4115-181 



SECTION III 
REMARKS 

Regarding the Amendments 

Claims 1-4, 6, 7, 9-13, 15, 16, 18, 20-22, 25, 37, 38, 44-50, and 53-59 have been amended as set 
forth in the above Complete Listing of the Claims. As amended, the claims are supported by the 
specification and the original claims. In general, the claims have been amended to clarify the 
claim language and are supported by the original claim language. The withdrawn claims have 
been amended similarly to the claims under examination, in order to maintain the complete claim 
set with consistency. 

Throughout the claim set clarification has been made to show that the prodomain protein as a 
claimed element binds with high affinity to a protease or variant thereof. Such high affinity is 
specifically described in the specification at p. 14, 11. 31-33 and p. 18, 11. 22-24. Additionally, the 
claim set has been amended to show that the claimed modified prodomain proteins have 
improved affinity as compared to an unmodified prodomain protein. Such improved affinity is 
specifically described in the specification at p. 15, 11. 1-3. 

No new matter has been added, as defined by 35 U.S.C. § 132. 

Thus, upon entry of the amendments, claims 1-61 will remain pending, of which claims 18-45 
and 50-61 are withdrawn. 

Objections to the Disclosure and Claims - Informalities 

The examiner has objected to the disclosure based on numerous informalities. Applicant 
respectfully submits that all such informalities are addressed by the present Response and 
withdrawal of the objection is specifically requested. 

The paragraph on page 21 has been amended to indicate that the SI 49 mutant contains a deletion 
of residues 75-83 as compared with the wild type. Rather than use the symbol A as indicated by 
the examiner, we have added the language "deletion of. Accordingly, the specification no longer 
contains this informality. 

Additionally, the examiner has objected to claims 1, 4, 6, 10, 11, 12, 16, 46 and 49 as containing 
informalities. The examiner's attention is respectfully drawn to Section II, Amendments to the 
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Claims above, where all of the claim informalities mentioned by the examiner have been 
amended. None of the informalities listed by the examiner remain in the claims, as pending. 

Specifically, 

• Claim 1 has been amended to include the article "a" before both "coding sequence" 
and "protease"; 

• Claim 4 has been amended to include spacing between "Y" and "H," "or" and "L," and 
"for" and "PI"; 

• Claim 6 has been amended to add a comma between "Y" and "H"; 

• Claim 10 has been amended such that the language that now precedes the amino acid 
sequence recites ". . .comprises the amino acid sequence. . ."; 

• Claims 11 and 16 have been amended to properly include the terms "alpha" and 
"membrane"; 

• Claim 12 has been amended to remove the term "for" following the term "encoded"; 

• Claim 46 has been amended to include the article "a" prior to "coding sequence"; and 

• Claim 49 has been amended to include a comma between "V" and "or." 

To the extent applicable, the same changes have been made to withdrawn, pending claims. All 
informalities noted by the examiner have been corrected. Withdrawal of the objections is 
respectfully requested. 



Ob jections to the Disclosure and Claims - Essential Material 

The examiner has objected to the disclosure as lacking essential material in that the structure of 
the sequence "pR8" is not clear from the specification. Applicant respectfully draws the 
examiner's attention to description of pR8 in the specification in the paragraph spanning pages 
20-21: 

Cloning and Expression of the Prodomain of Subtilisin The 
prodomain region of the subtilisin BPN'gene was subcloned 
using the polymerase chain reaction as described in Strausberg, 
et al. [138]. Mutagenesis of the cloned prodomain gene was 
performed according to the oligonucleotide-directed in vitro 
mutagenesis system, version 2(Amersham International pic) 
Example 1 To demonstrate the feasibility of prodomain-directed 
processing, a gene was constructed to direct the synthesis of a 
fusion of the pR8 prodomain onto the N-terminus of the 56 
amino acid B domain(GB) of streptococcal Protein G. 
Prodomain pR8, having the mutations at amino acid residues 16- 
21 (QTMSTM) which were replaced with SGIK creating a two 
amino acid deletion in pR8, wherein S replaces Q16, G replaces 
T17, M18I replaces S19 and T20 and K replaces M21; along 
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with additional substitutions A23C, K27Q, V37L, Q40C,H72K 
and H75K is independently stable and binds to subtilisin with- 
100-times higher affinity than the wild type prodomain. Further, 
pR8 thus becomes the cognate sequence specifying the subtilisin 
cleavage site. 

SEQ ID NO: 2 is identified on page 4 of the application as the 77 amino acid prodomain of 
subtilisin. The above-cited paragraph defines pR8 in relation to the subtilisin prodomain. 
Accordingly, the sequence of pR8 is the same as SEQ ID NO: 2 with the following mutations: 
SGIK for QTMSTM at positions 16-21, A23C, K27Q, V37L, Q40C, H72K and H75K. 

Additional pR8 mutations are described in the specification, pR8FKAM, where the 4 terminal 
amino acids of pR8 (AKAY) are replaced with FKAM and pR58 (or pR8FRAM), where the 4 
terminal amino acids of pR8 (AKAY) are replaced with FRAM. 

All three of these sequences have been included in the Revised Sequence Listing provided 
herewith. pR8 is included as SEQ ID NO: 4, pR8FKAM is included as SEQ ID NO: 5 and pR58 
is included as SEQ ID NO: 6. 

Withdrawal of the objection is respectfully requested. 

Objections to the Disclosure and Claims - Lack of Sequence Rules Compliance 

The examiner has alleged that the application is not in compliance with the requirements of 37 
C.F.R. §1.821, as applicable to an application containing nucleotide and/or amino acid 
sequences. Specifically, the examiner points to: 

• The nonapeptide in claim 10 and at page 6, line 21 and page 25, line 6; 

• The hexapeptide at page 20, line 33; and 

• Pentapaptides in the final paragraph of page 21 and on page 22 

as sequences that should be included in the Sequence Listing. In Response, the applicant 
respectfully draws to the examiner's attention to the Revised Sequence Listing provided 
herewith, where new sequences SEQ ID NOs 4-8 have been added. Specifically, SEQ ID NO: 4 
is the sequence of pR8, SEQ ID NO: 5 is the sequence of pR8FKAM, SEQ ID NO: 6 is the 
sequence of pR58 (or pR8FRAM), SEQ ID NO: 7 is the nonapeptide in claim 10 and at page 6, 
line 21 and page 25, line 6 and SEQ ID NO: 8 is the hexapeptide at page 20, line 33. The 
pentapeptides in the paragraph spanning pages 21 and 22 have not been included, as it is 
submitted that these are names of specific substrates, not actual pentapeptide sequences. 
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Entry of the Revised Sequence Listing and withdrawal of the objection is respectfully requested. 

Re jection of Claims Under 35 U.S.C. § 112 

• Structural Relationship 

In the Office Action mailed May 13, 2008, the examiner has rejected claims 1-17 and 46-49 
under 35 U.S.C. §112, second paragraph as allegedly missing an essential structural cooperative 
relationship of elements. In support of this allegation, the examiner states that the claims are 
incomplete for "omitting an essential structural cooperative relationship of elements...," citing 
MPEP §2172.01, which provides: 

"In addition, a claim which fails to interrelate essential elements 
of the invention as defined by applicant(s) in the specification 
may be rejected under 35 U.S.C. 112, second paragraph, for 
failure to point out and distinctly claim the invention. See In re 
Venezia, 530 F.2d 956, 189 USPQ 149 (CCPA 1976); In re 
Collier, 397 F.2d 1003, 158 USPQ 266 (CCPA 1968)..." 

Applicant respectfully disagrees that the claims fail to satisfy the written description 

requirement. 

The standard for complying with the written description requirement, as stated in MPEP 2163 is 
that "a patent specification must describe the claimed invention in sufficient detail that one 
skilled in the art can reasonably conclude that the inventor had possession of the claimed 
invention." Applicant respectfully submits that the claimed invention is so described in the 
present application. 

Independent claim 1 recites a nucleic acid construct encoding a fusion protein, where the fusion 
protein has two elements, a protein of interest and a prodomain protein. Those elements are 
recited as "operatively linked" in the encoded fusion protein. Independent claims 7, 12, and 13 
also contain this language. Furthermore, all of the pending independent claims containing the 
"operatively linked" language are noted to recite functional language of the fusion protein: claim 
1: "...wherein the prodomain protein has binding with high affinity to a protease or a variant 
thereof; claim 7: "...wherein the prodomain protein is modified to exhibit an increased affinity 
for subtilisin or a variant thereof..."; claim 12: "...having binding with high affinity for 
subtilisin. . ."; claim 13: ". . .modified to bind subtilisin or a variant thereof. . .". 
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Use of the term "operatively linked" clearly provides that the elements of the fusion protein 
recited in the claims must be linked in a manner that allows existence of the recited functionality. 
One of skill in the art would clearly understand that joining of the elements of the fusion protein 
in a manner that prevented the recited functionality would result in a fusion protein outside the 
scope of the claimed invention. Therefore, recitation of the orientation of the two elements of 
the fusion protein in the claims is not necessary. 

In the specification, applicant has provided numerous examples of fusion proteins, as recited in 
the claims. The examples focus on inclusion of prodomain portions that have a high affinity for 
subtilisin, but the claims of the invention are not so limited. While a fusion protein containing a 
subtilisin prodomain portion may have the necessity of being amino-proximal with respect to the 
protein of interest element of the fusion protein, a prodomain specific for another protease may 
need to be positioned differently. Therefore, inclusion of the orientation in the claims would be 
unnecessarily limiting. 

The examiner's attention is respectfully drawn to the following relevant law on the written 
description requirement of 35 U.S.C. § 112 first paragraph, as declared by the Federal Circuit. 
"The specification need not describe the claimed subject matter in exactly the same term[s] as 
used in the claims; it must simply indicate to persons skilled in the art that as of the [filing] date 
the applicant had invented what is now claimed." Eiselstein v. Frank, 52, F.3d 1035, 1038, 34 
USOQ2d 1467, 1470 (Fed. Cir. 1995). More recently, with regard to the biochemical arts, the 
Federal Circuit has held that, "(1) examples are not necessary to support the adequacy of a 
written description; (2) the written description standard may be met ... even where actual 
reduction to practice of an invention is absent; and (3) there is no per se rule that an 
adequate written description of an invention that involves a biological macromolecule must 
contain a recitation of known structure." (emphasis added) Falkner v. Inglis, 448 F.3d 1357, 
1366, 79 USPQ2d 1001, 1007 (Fed. Cir. 2006). 

In accordance with this case law, from the examples provided by applicant in the application, 
fusion proteins encompassing prodomains with affinity for proteases other than subtilisin are 
within the scope of the claims and the claims satisfy the written description requirement. 

Importantly the fusion proteins recited in the claims, whether as an expression product of a 
claimed nucleic acid construct, or as used in a claimed method, must contain two essential 
elements: 1) a protein of interest (or "target" protein) and 2) a prodomain protein. 
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MPEP §2172.01 cited by the examiner also cites the Ex parte Nolden case. The examiner's 

attention is respectfully drawn to a portion of that case not cited in the MPEP where the court, in 

its analysis noted that: 

"The claimed functional relationship gives life and meaning to 
the claimed structure of the adapter. . ." 

And in supporting a finding that a portion of the claims satisfied the written description 

requirement: 

"...we find no ambiguity as to what structure and structural 
relationships are intended to be covered by the claims. In other 
words, we consider the claims as 'particularly pointing out and 
distinctly claiming the subject matter which the applicant 
regards as his invention,' as required by 35 U.S.C. 112." 

The pending claims of the present application provide a clear structural relationship between the 

elements of the recited fusion protein; those elements must be operatively linked and the 

resulting fusion protein must contain a prodomain protein portion that has high binding affinity 

for a protease. Accordingly, one of skill in the art can reasonably conclude that the inventor had 

possession of the claimed invention at the time of filing, without inclusion of a recitation of the 

orientation of the elements of the fusion protein. Withdrawal of the rejection is therefore 

respectfully requested. 

• Relative/Functional Terms 

Additionally the examiner has rejected claim 1 under 35 U.S.C. § 112, second paragraph as 
indefinite for failing to provide a basis for "increased affinity" and for "corresponding." As 
amended, claim 1 no longer contains either of these terms. Accordingly, removal of the rejection 
is respectfully requested. 

• Relative Term 

Claim 10 is also rejected under 35 U.S.C. § 112, second paragraph as indefinite for containing 
the term "cognate." As amended, claim 10 no longer contains this term. Accordingly, removal 
of the rejection is respectfully requested. 

• Distinction Between Elements 



19 



4115-181 



The examiner has also rejected claim 46 under 35 U.S.C. § 112, second paragraph as indefinite 
for failing to provide a distinction between the two fusion partners of the fusion protein. The 
examiner's attention is respectfully drawn to the amended language of claim 46, as set forth 
above, where the claim now recites the elements "a protein of interest" and "a second protein." 
As amended it is clear that it is the second protein that "generates affinity" for a protease or 
variant thereof. Withdrawal of the rejection in light of the clarity of the amended language is 
respectfully requested. 

Rejection of Claims Under 35 U.S.C. § 102 

In the Office Action mailed May 13, 2008, the examiner has rejected claims 1, 46 and 47 under 
35 U.S.C. §102(e) as anticipated by U.S. Patent Publication No. 2003/0166162, hereinafter 
referred to as "Van Roojen et al." Applicant respectfully disagrees. 

Anticipation of a claim requires the disclosure in a single prior art reference of each element of 
the claim under consideration. ( Verdegaal Bros, v. Union Oil Co. of California , 814 F.2d 628, 
631, 2 USPQ2d 1051, 1053 (Fed. Cir. 1987.) 

The examiner points to Van Roojen et al. as disclosing 

"the preparation of polynucleotides that encode fusion 
polypeptides comprising any one of several modified chymosin 
prodomains fused directly to the amino terminus of any one of 
several desired, carboxyl-proximal, polypeptide fusion partners 
- hormones - wherein the modified prodomains present in 
recombinantly expressed fusion polypeptides are disclosed to 
have an improved affinity for chymosin as shown by increased 
yields of cleaved fusion partner achieved, relative to that 
achieved with an unmodified chymosin prodomain. . ." 

The present invention recites a construct encoding a fusion protein where the fusion protein 

comprises, in part, a prodomain protein with a high binding affinity for a corresponding protease 

or variant thereof. It is respectfully submitted that the constructs of Van Roojen et al. do not 

possess prodomain portions with the characteristics of the prodomains recited in the claims of the 

present application . 

Claims 1, 46 and 47 recite a construct encoding a fusion protein comprising a prodomain that has 
a high affinity for a protease or variant thereof. In the specification this characteristic is defined 
as 
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"[t]he phrase 'binding with high affinity' as used herein refers to 
the ability of the protease prodomain to bind to the cognate 
protease with a Kd of nM to pM and ranging from about 10 nM 
to about 10 pM, preferably < 100 pM." 

Specifically, "affinity" is the ability to bind and stay bound and is evidenced by a low 

disassociation constant (and therefore high affinity constant). This is demonstrated in the 

examples of the specification where it is shown that the fusion proteins are used to purify 

products, where the prodomain binds a protease tightly, then the bound protease and the protein 

of interest are cleaved. The protein of interest is recoverable and the bound prodomain is 

disabled, as it remains bound to the protease. Therefore "affinity" as used in claims 1 and 46 

describe a prodomain protein that can bind tightly and remain bound to a protease. 

In support of his position, the examiner directs the applicant's attention to Table 2 of Van Roojen 
et al., concluding that this table shows improved affinity due to the showing of increased yields 
of cleaved fusion partner. Applicants respectfully disagree. Table 2 shows the results of 
Example 4 of the application. In that Example, a GST-cystatin fusion protein is generated, with 
chymosin pro-peptides inserted in various locations to form different fusion proteins (shown in 
Figs. 13-17: GST-KLIP4-Cystatin, GST-KLIPll-Cystatin, GST-KLIP12-Cystatin, GST-KLIP14- 
Cystatin, GST-KLIP15-Cystatin, and GST-KLIP16-Cystatin). Chymosin is added to each of the 
fusion proteins and cleaves the fusion protein at the pro-peptide sites. The chymosin does not 
bind to the KLIP pro-peptide with high affinity, but associates only transiently as is typical of 
enzyme-substrate-product interactions. This allows release of chymosin after cleavage, where it 
is free to cleave additional fusion proteins. The results of this action are what are shown in Table 
2. It is clear from the results (with the exception of KLIP 12) that the longer the reaction is 
allowed to proceed, the chymosin continues to react with additional fusion peptides in the 
solution and more free cystatin is generated. 

Therefore the results shown in Table 2 of Van Roojen et al. demonstrate the "catalytic turnover" 
of the fusion protein generated, rather than the affinity itself. Catalytic turnover quantifies the 
results of the reaction between KLIP and chymosin. Table 2 shows the amount of free cystatin 
generated as multiple reactions are allowed to occur between the fusion proteins (at the KLIP 
site) and a determinate amount of chymosin. Van Roojen et al. do not address the affinity 
between KLIP and the chymosin. As described above and as utilized in the claims, affinity is a 
measure of the binding of two components. Therefore, Van Roojen et al. do not provide a 
showing of a construct encoding a fusion protein comprising a prodomain protein with a high 
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affinity for a protease. 

As Van Roojen et al. do not describe a nucleic acid construct as set forth in claims 1, 46 or 47, 
Van Roojen et al. do not anticipate the claimed invention. Accordingly, withdrawal of the 
rejection of claims 1, 46, and 47 under 35 U.S.C. § 102(e) as being anticipated by Van Roojen et 
al. is respectfully requested. 

Re jection of Claims Under 35 U.S.C. § 103 

In the Office Action mailed May 13, 2008, the examiner has rejected claims 2, 3, 5, 7, 8, 11, 12, 
13, 14, 17, and 48 under 35 U.S.C. §103(a) as obvious over Van Roojen et al. in view of Gr0n et 
al., "Studies of binding sites in the subtilisin from Bacillus lentus by means of site directed 
mutagenesis and kinetic investigations," Subtilisin Enzymes: Practical Protin Engineering, Bott, 
R. and Betzel, C, Eds., p. 105-112, 1996 (hereinafter Gr0n et al.). Applicant respectfully 
disagrees. 

It is elemental law that in order for an invention to be obvious, the difference between the subject 
matter of the application and the prior art must be such that the subject matter as a whole would 
have been obvious at the time the invention was made to a person of ordinary skill in the art. In 
order to meet this standard for a proper §103 rejection, all claim limitations must be disclosed or 
derivable from the cited combination of references, there must be a logical reason to combine the 
cited references to produce an operable combination and there must be a reasonable expectation 
of success. (MPEP §2143) 

Claims 2, 3, 5, 7, 8, 11, 12, 13, 14, 17, and 48 recite nucleic acid constructs, fusion proteins and 
methods of using the same. However, all of these claims contain as an element, a fusion protein 
comprising a prodomain protein with a high affinity for a protease. The combination of Van 
Roojen et al. and Gr0n et al. does not disclose or describe such a fusion protein. In particular, 
both Van Roojen et al. and Gr0n et al. provide showings including catalytic turnover or catalytic 
efficiency of a substrate. Such a showing does not correlate directly to or even relate to the 
characteristic of affinity as recited in the present claims. 

As set forth in detail above with respect to the rejection based on 35 U.S.C. §102, it is 
respectfully submitted that Van Roojen et al. do not provide a showing of a construct encoding a 
fusion protein comprising a prodomain protein with a high affinity for a protease. 
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The examiner cites Gr0n et al. as demonstrating "the improvement of the affinity of a subtilisin 
having an amino acid sequence modification within the protease subsite..." In support of his 
position, the examiner draws the applicant's attention to Table 3 at page 110 of Gr0n et al., 
stating that it shows "the binding affinity, measured by rate of proteolysis." Applicant 
respectfully submits that Table 3 does not demonstrate the results alleged by the examiner. 

It is well established that affinity and catalytic efficiency (or catalytic turnover) are separate 
considerations and are not interchangeable determinations. (See Strausberg, S.L. et al. "Directed 
Coevolution of Stability and Catalytic Activity in Calcium-free Subtilisin," Biochemistry , 2005 
Mar 8;44(9):3272-9; Hedstrom, L. (2002) "Serine protease mechanism and specificity." Chem 
Rev 102, 4501-4524; attached hereto as Exhibits A and B.) Table 3 of Gr0n et al. shows the 
catalytic efficiency (kc at /K M ) values for the hydrolysis of fluorogenic peptide substrates. As is 
well known in the art, k cat is the overall catalytic rate of an enzyme, calculated as the number of 
substrate molecules converted to product by each catalytic site as a function of time. K M is a 
measure of the affinity of a substrate for an enzyme, known as the Michaelis-Menten constant. 
The measure of catalytic efficiency, as provided in Gr0n et al. is a measure of the substrate's 
ability to interact with an enzyme. It is a measure of the reaction between those two, not the 
ability to bind and stay bound. The results in Table 3, therefore demonstrate the ability of that 
substrate to participate in a reaction and produce product. 

Van Roojen et al. and Gr0n et al. fail to provide any derivative basis for the claimed invention. In 
combination, the two references fail to provide a showing of fusion protein comprising a 
prodomain protein with a high affinity for a protease. Accordingly, no basis of prima facie 
obviousness of the claimed invention is presented by such cited references. 

As Van Roojen et al. in view of Gr0n et al. does not provide any logical basis for the constructs, 
fusion proteins, or methods recited in claims 2, 3, 5, 7, 8, 11, 12, 13, 14, 17, and 48, Van Roojen 
et al. in view of Gr0n et al. does not render the claimed invention obvious. Accordingly, 
withdrawal of the rejection of claims 2, 3, 5, 7, 8, 11, 12, 13, 14, 17, and 48 under 35 U.S.C. § 
103 (a) as being obvious over Van Roojen et al. in view of Gr0n et al. is respectfully requested. 

CONCLUSION 

Based on the foregoing, all of Applicant's pending claims 1-17 and 46-49 are patentably 
distinguished over the art, and are in form and condition for allowance. The Examiner is 
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requested to favorably consider the foregoing and to responsively issue a Notice of Allowance. 

The time for responding to the May 13, 2008, Office Action without extension was set at three 
months, or August 13, 2008. This response is therefore timely and no fees are believed to be due 
for the filing of this paper. However, should any fees be required or an overpayment of fees 
made, please debit or credit our Deposit Account No. 08-3284, as necessary. 

If any issues require further resolution, the Examiner is requested to contact the undersigned 
attorneys at (919) 419-9350 to discuss same. 

Respectfully submitted, 



Date: August 12, 2008 /steven j. hultquist/ 

Steven J. Hultquist 
Reg. No. 28,021 
Attorney for Applicants 



Date: August 12, 2008 /kelly k. reynolds/ 

Kelly K. Reynolds 
Reg. No. 51,154 
Attorney for Applicants 
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1 Directed Coevolution of Stability and Catalytic Activity in Calcium-free Subtilisin 1 ^ 

2 Susan L. Strausberg, Biao Ruan, Kathryn E. Fisher, Patrick A. Alexander, and Philip N. Bryan* 

3 Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, 

4 9600 Gudelsky Drive, Rockville, Maryland 20850 

5 Received October 13, 2004; Revised Manuscript Received December 17, 2004 

6 abstract: We have coevolved high activity and hyperstability in subtilisin by sequentially randomizing 

7 12 amino acid positions in calcium-free subtilisin. The optimal amino acid for each randomized site was 

8 chosen based on stability and catalytic properties and became the parent clone for the next round of 

9 mutagenesis. Together, the 12 selected mutations increased the half-life of calcium-free subtilisin at elevated 

10 temperature by 15 000-fold. The catalytic properties of the mutants were examined against a range of 
n substrates. In general, only mutations occurring at or near the substrate -binding surface have measurable 

12 effects on catalytic constants. No direct influence of stability on catalytic properties was observed. A 

13 high-stability mutant, Sbtl40, was a more efficient enzyme in terms of k c JK m than a commercial version 

14 of subtilisin across a range of substrates but had a lower k CSLt against tight-binding substrates. The reason 

15 for this behavior was discerned by examining microscopic rate constants for the hydrolysis of a tight- 

16 binding peptide substrate. Burst kinetics were observed for this substrate, indicating that acylation is not 

17 rate-limiting. Although acylation occurs at the rate of substrate binding, & cat is attenuated by the slow 

18 release of the N-terminal product. Natural evolution appears to have optimized catalytic activity against 

19 a range of sequences by achieving a balance between substrate binding and the rate of release of the 

20 N-terminal product. 

21 Subtilisin BPN' from Bacillus amyloliquefaciens is a N218S, and Q271E). This mutant is referred to as Sbt88. 47 

22 secreted, serine protease, which degrades proteins in the The directed evolution of calcium-free subtilisin has been 48 

23 extracellular environment and provides amino acids to the described previously (72). The X-ray crystal structure of 49 

24 Bacillus (1—3). It is an important industrial enzyme as well Sbt88 subtilisin (PDB ID 1DUI) is described in Almog et so 

25 as a model system for protein engineering studies (4). al. (13). 51 

26 Subtilisins from mesophilic organisms are generally highly Xhe i 00 p-deleted background has become a platform for 52 

27 efficient catalysts, turning over a broad spectrum of protein further engineering because stability can be studied inde- 53 

28 substrates at rates exceeding 10 6 M 1 s 1 at 25 °C (5). pen dent of calcium binding and because of the potential 54 

29 Thermophilic subtilisins generally are less efficient. One of usefulness of chelant-stable subtilisins for biotechnological 55 

30 the practical goals of protein-engineering efforts is to create applications. This paper describes the identification of 12 56 

31 subtilisins that are both highly stable and highly efficient additional stabilizing mutations by site-directed random 57 

32 catalysts (<5, 7). Here, we evolve a hyperstable subtilisin and mutagenesis in the loop-deleted background. Collectively, 58 

33 measure catalytic properties as a function of stabilizing the 12 mutations increase enzymatic half -life by 15 000 times 59 

34 mutation. at temperatures >65 °C. The goal of this paper is to examine 60 

35 The variants examined here lack the calcium-binding loop interrelationships between stability and catalytic properties. 61 

36 A of natural subtilisins. Site A is formed from a nine amino 

37 acid bubble in the last turn of an a helix, comprising amino MATERIALS AND METHODS 62 

38 acids 63—85 (helix C) (8). In previous work, we have shown 

39 that deleting amino acids 75—83 creates an uninterrupted Expression and Purification of Subtilisins. The subtilisin 63 

40 helix with normal helical geometry over its entire length and gene from Bacillus amyloliquefaciens (subtilisin BPN') has 64 

41 abolishes the calcium-binding potential at site A (9—11). been cloned, sequenced, and expressed at high levels from 65 

42 Nevertheless, regions of subtilisin that were adjacent to the its natural promoter sequences in Bacillus subtilis (2, 3). All 66 

43 75—83 loop have lost many favorable interactions. Conse- mutant genes (encoding the signal peptide, prodomain, and 67 

44 quently, a number of compensating mutations were identified mature subtilisin) were recloned into a pUBl 10-based 68 

45 and introduced into the deletion mutant to restore stability expression plasmid and used to transform B. subtilis. The 69 

46 (Q2K, 1 S3C, P5S, K43N, M50F, A73L, Q206C, Y217K, B. subtilis strain used as the host contains a chromosomal 70 
deletion of its subtilisin gene and therefore produces no 71 

t This work was supported by NIH Grant GM42560. background wild-type activity (14). Subtilisin varaints were 72 

* To whom correspondence should be addressed. E-mail: expressed in a 5 L New Brunswick fermentor at a level of 73 

bryan@umbi.umd f eda Telephone: 301-738-6220. Fax: 301-738-6255. > m ^ Fermentation conditions and purification were 74 

1 A shorthand for denoting amino acid substitutions employs the . r . 

single-letter amino acid code as follows: Q2K denotes the change of as described (15). The enzyme concentration (E) was 75 

glutamine at position 2 to lysine. determined using £0.1% = 1.17 at 280 nm (1). For variants 76 

10.1021/bi047806m CCC: $30.25 © xxxx American Chemical Society 
Published on Web 00/00/0000 PAGE EST: 7.2 
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77 that contain a Y217 mutation, the E 0.1% at 280 nm was 

78 calculated to be 1.12 based on the loss of one Tyr residue 

79 (16). 

80 Construction of Active Calcium-free Subtilisin Genes. 

81 Because calcium-free subtilisin is lethal to Escherichia coli, 

82 mutagenesis to create genes for active calcium-free subtilisin 

83 had to be carried out in such a way to prevent leaky 

84 expression in the steps in which E. coli was used as a host. 

85 To separate the A75— 83 subtilisin gene into nonfunctional 

86 pieces, an Mlu I site was introduced at the junction between 

87 the prosequence and the mature portion in the expression 

88 plasmid for the inactive calcium-free subtilisin Sbtl5 (pS 15) 

89 (9). Subtilisin Sbtl5 is inactive because of a mutation of 

90 S221C. An Mlu I site was also introduced into Ml 3 mpl9. 

91 The portion of Sbtl5 subtilisin gene encoding only mature 

92 subtilisin was then subcloned as a 1.2-kb Mlu I to Sal I 

93 fragment into the altered Ml 3 mpl9. The ml3 clone was 

94 mutagenized to convert cysteine 221 back to serine. This 

95 construction was incapable of producing active subtilisin 

96 because it contained neither the expression signals nor the 

97 prosequence. The mutagenized Mlu I to Sal I fragment, 

98 containing the mature portion of the active gene, was ligated 

99 into the expression plasmid and cloned directly into B. 

100 subtilis. 

101 Directed Mutagenesis and Screening for Increased Stabil- 

102 ity. Site-directed mutagenesis was performed on subtilisin 

103 genes with oligonucleotides, which were degenerate at a 

104 chosen codon using QuikChange (Stratagene) mutagenesis 

105 kits according to the instructions of the manufacturer. The 

106 degenerate codon contained all combinations of the sequence 

107 NNB, where N is any of the four nucleotides and B is T, C, 

108 or G. The 48 codons represented in this population encode 

109 for all 20 amino acids but exclude the ochre and amber 
no termination codons. The mutagenized gene was ligated into 
in an expression plasmid and used to transform B. subtilis. To 
112 have a 98% chance of finding an amino acid represented 
us only once in the population of codons NNB (W, E, Q, or 

114 M), one must screen about 200 mutant clones. Codons for 

115 all other amino acids are represented by at least two codons 
lie in the population and would require screening of about 100 

117 mutant clones to have a 98% chance of being represented in 

118 the mutant population. A total of 200—300 clones were 

119 screened in each experiment. 

120 Mutant clones were grown in microtiter dishes and 

121 screened for proteolytic activity using 1% skim milk. A total 

122 of 50 fiL of 2% yeast extract was dispensed in each of the 

123 96 wells of a microtiter dish. Each well was inoculated with 

124 a Bacillus transformant and incubated at 37 °C with shaking. 

125 After 18 h of growth, 5 fiL of culture was diluted into 

126 95 fiL of 100 mM tris(hydroxymethyl)aminomethane (Tris) 2 - 

127 HC1 and 100 mM NaCl at pH 8.0 and 1% skim milk in a 

128 second microtiter dish. The wells in which the turbidity 

129 (absorbency at 600 nM) decreased the fastest were presumed 

130 to contain the most active subtilisin mutants. 

131 Determination of Inactivation Rates. The kinetics of 

132 inactivation were determined as follows. Subtilisin at a 



2 Abbreviations: sAAF-AMC, succinyl-Ala-L-Ala-L-Phe-7-amino- 
4-methylcoumarin; sAAPF-pNA, succinyl-L-Ala-L-Ala-L-Pro-L-Phe-p- 
nitroanilide; sDVRAF-AMC, succinyl-L-Asp-L-Val-L-Arg-L-Ala-L-Phe- 
7-amino-4-methylcoumarin; t m , half-life for a kinetic experiment; Tris, 
tris(hydroxymethyl)aminomethane. 



1 juM concentration was dispensed in aliquots of 0.5 mL 
into 1 mL glass test tubes and covered with Parafilm. The 
tubes were placed in a circulating water bath at the 
appropriate temperature. At each time point, a tube was 
removed and quickly transferred to an ice bath. A 10 pCL 
aliquot was removed, and residual activity was assayed in 
990 fiL of 1 mM succinyl-L-Ala-L-Ala-L-Pro-L-Phe-p-nitroa- 
nilide (sAAPF-pNA), 0.1 M Tris-HCl at pH 8.0, and 0.1 M 
NaCl. The inactivation time course was followed over four 
half-lives (17). 

Kinetic Measurements. Steady-state assays of the hyd- 
rolysis of sAAPF-pNA were as described by DelMar et al. 
(18). Succinyl-DVRAF-7-amino-4-methylcoumarin (AMC) 
was purchased from AnaSpec Inc. The concentrations of the 
AMC substrates were determined by absorbency at 324 nm 
using an extinction coefficient of 16 mM -1 cm -1 . Reaction 
kinetics of AMC substrates were measured using a KinTek 
stopped-flow model SF2001 (excitation X = 380 nm, and 
emission = 400 nm cutoff filter). A stock solution of enzyme 
at a concentration of 2 mM in 10 mM KPi at pH 5.7, 
100 mM NaCl, and 50% glycerol was prepared for kinetic 
studies. Immediately prior to stop-flow mixing, the enzyme 
was diluted into 100 mM Tis-HCl at pH 8.3 and 100 mM 
NaCl and placed in syringe A of the SF2000. A solution of 
the substrate in the same buffer was place in syringe B. 
Solutions were equilibrated at 25 °C. Fluorescence data were 
collected after one to one mixing of the two solutions. For 
measurements of single turnover kinetics, the final enzyme 
concentration was varied from ~0.5— 20 juM with a constant 
substrate concentration of 50 pM. Typically, 10—15 
kinetics traces were collected for each [E\. Kinetic mecha- 
nisms were simulated using KinTekSim (19, 20), which 
was obtained from the KinTek Corporation website 
(www.kintek-corp.com). 

RESULTS 

Directed Mutagenesis and Screening. A total of 12 amino 
acid positions in Sbt88 subtilisin (Table 1) were chosen for 
random mutagenesis based on earlier studies. Amino acid 
position and references are as follows: S9 (7); 131 (27), E156 
(22, 23), G166 (24, 25), G169 (16, 26), S188 (25), N212 
(27), A216 (28), Y217, (16, 26), N218 (29), M222 (30), and 
T254 (25) (Table 1 and Figure 1). These were the only sites 
screened, although positions 8, 14, 27, 53, 63, 89, 109, 116, 
118, 131, 161, 170, 172, 181, 194, 195,213, and 256 would 
also have been good candidates based on earlier reports 
(7, 25, 31-34 ). 

Generating and screening subtilisin variants involved in 
vitro random mutagenesis at each site of the Sbt88 subtilisin 
gene, expression of mutated genes in B. subtilis, and 
screening for activity versus casein as described in the 
Materials and Methods. Once preliminary identification of 
an active mutant was made from the microtiter dish assay, 
the corresponding Bacillus clone was grown for further 
analysis. The DNA sequence of the mutagenized site was 
determined for each clone confirmed to produce a high level 
of casein activity. Active mutants were further analyzed for 
stability and sAAPF-pNA activity. 

The kinetics of thermal inactivation were measured in 
100 mM Tris-HCl at pH 8.3 and 100 mM NaCl at 
temperatures from 65 to 75 °C. The rate of inactivation at 
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Table 1: Mutation in Subtilisin a 



mutation 


Sbt88 


Sbt98 


Sbt99 


Sbtl03 


Sbtl07 


Sbtl09 


SbtllO 


Sbtll6 


Sbtl40 


SbtlSl 


Sbtl53 


Sbtl84 




X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


jjL 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


P^Q 

rDij 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


eoA 




















X 


X 


X 


1 T 












X 


X 


X 


X 


X 


X 


X 


JV'H- J IN 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


ivi jur 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


A71T 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


ZA / J> OJ 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 








X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


G166S 


























A 1 ^QA 




X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


S188P 


















X 


X 


X 


X 


Q206C 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


N212G 
















X 


X 


X 


X 


X 


Y217L 








X 


X 


X 


X 


X 


X 


X 


X 


X 


A216E 






















X 


X 


N218S 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


M222Q 
























X 


T254A 










X 


X 


X 


X 


X 


X 


X 


X 


Q271E 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


X 


a The x denotes that a variant subtilisin contains a specified mutation. 




Figure 1 : Structure of subtilisin S221 A mutant of subtilisin Sbtl 10 
(Gary Gilliland, manuscript in preparation). Sites of random 
mutation are shown by the cyan spheres. Active-site residues D32, 
H64, and S221 are shown in red. Pentapeptide AAAAL bound to 
the binding cleft is shown in yellow. 

193 elevated temperatures and low concentrations (e.g., < 1 fiM) 

194 is characterized by a single-exponential decay curve, whose 

195 time constant is determined predominantly by the rate of 

196 subtilisin unfolding. Thus, mutations that decrease the rate 

197 of inactivation under these conditions can be considered to 

198 have increased the activation energy of unfolding (75, 17). 

199 Steady-state kinetic parameters against the peptide sub- 

200 strate sAAPF-pNA were determined in 100 mM Tris-HCl 

201 at pH 8.3 and 100 mM NaCl at 25 °C. The optimal amino 

202 acid for each randomized site was chosen based on stability 

203 and catalytic properties and became the parent clone for the 

204 next round of mutagenesis. The order of mutation was chosen 

205 arbitrarily and is shown in Table 2. No wild-type BPN' amino 

206 acid was judged to be optimal at any of the 12 positions. 



This was expected because stabilizing mutations at these sites 207 

had been found previously in the wild-type background. The 208 

gene for the initial parent subtilisin (Sbt88) had mutations 209 

at 2 of the 12 sites: Y217K and N218S. The screening re- 210 

identified serine as the optimal amino acid at 218. At position 211 

217, L, V, Q, S, M, and Y were identified in the primary 212 

microtiter dish screen. Leucine at 217 was chosen for 213 

propagation based on its k cs JK m versus sAAPF-pNA, even 214 

though it is less stable than K217 and M217. Positions 166 215 

and 222 also required a compromise between stability and 216 

catalytic activity. Serine was chosen at position 166 because 217 

of the 2. 8 -fold increase in half-life, even though catalytic 218 

turnover is 0.67 that of the wild-type G166. At position 222, 219 

glutamine was identified by random selection but reduced 220 

k c JK m by 10-fold relative to the wild- type methionine. The 221 

M222Q mutant was selected for further study because 222 

removal of the peroxide- sensitive M222 from the active- 223 

site region of subtilisin makes the enzyme virtually insensi- 224 

tive to 1 M hydrogen peroxide (30, 35, 36). Table 2 225 

summarizes the stability and kinetic data for all of the 226 

selected mutants. Mutations occurring at or near the substrate- 227 

binding surface have measurable effects on catalytic con- 228 

stants: I31L, E156S, G166S, G169A, A216E, K217L, and 229 

M222Q. The kinetic effects of these mutations are basically 230 

as anticipated from previous work (7). Mutations distal to 231 

the active site (S9A, S188P, N212G, and T254A) did not 232 

effect steady-state parameters. Contributions from the indi- 233 

vidual stabilizing mutations accrue cumulatively; thus, build- 234 

ing a highly stable subtilisin was accomplished in a step by 235 

step manner by sequentially mutagenizing the Sbt88 gene 236 

and selecting the desired properties. All 12 mutations increase 237 

the half-life of calcium-free subtilisin at elevated temperature 238 

by 15 000-fold. Parallel mutagenesis and screening of 12 239 

Sbt88 gene libraries would have been more expeditious had 240 

we not wished to examine the accumulated effects of 241 

stabilizing mutations on catalytic activity. Optimal mutations, 242 

once identified in the Sbt88 background, could have been 243 

combined to create stable, active mutants because of the 244 

independent effects of individual mutations. The strategy of 245 
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Table 2: Selection for Stability and Catalytic Activity 0 



ha 



parent strain 


random site 


observed mutations 




(65 °C) 


(75 °C) 


!SbtOO 


CO 1 o 




a or 


AC\f\ 


U.J 






r \ 
U 


rk 1 rk 
U.1U 


22U 










nil 


oj 




^Dtoo 


( \ 1 /^Q 


A 
A 


i 
1 


onnn 

2UUU 


_ 


!SDty© — t^Dtoo -r Aloy) 


h,l jo 




.3 




2.4 


Sbtyy — (SbWo + S15o) 


K217 


L 


C 1 

5.1 




1 *7 

1.7 






V 


Z.J 




U.J 






c 


1 /i 
1 .4 




n o 

u.y 






Q 


2.5 




l.J 






M 


2. / 




2.0 






Y* 


5.0 




0.8 


Sbtl03 = (Sbt99+L217) 


T254 


A 


6 




6 


Sbtl07 = (Sbtl03 + A254) 


131 


L 


6 




9.7 


Sbtl09 = (Sbtl07+L31) 


G166 


S 


4 




27 






R 


5 




18 


SbtllO = (Sbtl09 + S166) 


N212 


G 


4 




41 


Sbtll6 = (SbtllO + G212) 


S188 


P 


4 




53 


Sbtl40 = (Sbtll6 + P188) 


S9 


A 


4.2 




95 


Sbtl51 = (Sbtl40 + A9) 


A216 


E 


4.7 




160 


Sbtl53 = (Sbtl51 +E216) 


M222 


Q(Sbtl84) 


0.45 




450 



a Steady-state kinetic parameters versus sAAPF-pNA were measured in 100 mM NaCl and 100 mM Tris-HCl at pH 8.3. Half-lives were measured 
in 100 mM NaCl and 100 mM Tris-HCl at pH 8.0, at an initial enzyme concentration of 1 juM. An asterisk denotes a reversion to the wild-type 
subtilisin BPN' amino acid. 



Table 3: Steady-State Kinetics 0 

Substrate: sAAPF-pNA 



subtilisin 


K m (juM) 




k c JK m (M 1 s 


0 


SBT* 


450 


210 


0.47 x 10 6 




Sbtl40 


48 


190 


4.0 x 10 6 




Sbtl51 


45 


190 


4.2 x 10 6 




Sbtl53 


46 


218 


4.7 x 10 6 




Sbtl84 


300 


134 


0.45 x 10 6 






Substrate: G B F30H 






subtilisin 


K m (juM) 


&cat (S 0 


k c JK m (M~ l s- 


l ) 


SBT* 


120 


20 


1.7 x 10 5 




Sbtl40 


7 


3.5 


5.0 x 10 5 




Sbtl84 


30 


4.7 


1.6 x 10 5 






Substrate: 


sAAF-AMC 






subtilisin 




fccatts" 1 ) 


k c JK m (M~'s- 


l ) 


SBT* 


360 


6.4 


0.16 x 10 5 




Sbtl40 


63 


26 


4.1 x 10 5 






Substrate: sDVRAF-AMC 






subtilisin 


K m (juM) 


^(s" 1 ) 


k c JK m (M~ l s- 




SBT* 


33 


68 


2.1 x 10 6 




Sbtl40 


2.5 


31 


1.2 x 10 7 




Sbtl84 


11.4 


43 


3.8 x 10 6 





a Steady-state kinetic parameters were measured in 100 mM NaCl 
and 100 mM Tris-HCl at pH 8.3 and 25 °C. SBT* is a commercial 
version of sibtilisin BPN / containing the Y217L mutation. 



246 simultaneously mutagenizing all 12 sites in a single Sbt88 

247 mutant library is not prudent because the size of the library 

248 (~4 x 10 15 ) is far too large to be screened effectively. 

249 Analysis of Kinetic Properties versus sAAPF-pNA. The 

250 steady-state catalytic properties of the last four mutants in 

251 the series (Sbtl40, Sbtl51, Sbtl53, and Sbtl84) versus 

252 sAAPF-pNA are summarized in Table 3. Catalytic behavior 

253 is referenced against the Y217L variant of subtilisin BPN'. 

254 There are two reasons for this. First, it was found that 

255 interactions between the fluorogenic leaving group AMC and 

256 tyrosine 217 have large effects on the acylation rates 

257 observed for AMC substrates. Hence, for comparisons 



Scheme 1 

ki k 2 k 3 

E + S ^ ES — > EP E + P 

k_i k- 3 

between substrates with pNA and AMC leaving groups, the 258 

Y217L mutant is a better reference enzyme. Second, the 259 

Y217L mutant is a commercial version of subtilisin BPN' 260 

and is regarded as a better general protease (37). We found 261 

it to be more active than the wild-type enzyme against the 262 

range of substrates examined here. The Y217L mutant of 263 

subtilisin BPN' will be denoted as SBT*. All stabilized 264 

variants, except Sbtl84, were found to be more active than 265 

SBT* versus sAAPF-pNA (Table 3). However, none dis- 266 

played more than 30% of the activity of SBT* against 1% 267 

casein. We investigated the reason for this behavior. 268 

Analysis of Kinetic Properties versus the Protein Substrate. 269 

To obtain a more detailed assessment of catalytic properties 270 

versus a complex protein, we used a F30H mutant of the 56 271 

amino acid IgG-binding domain of Streptococcal protein G 272 

as a substrate (38). The AGunfoiding of the mutant (denoted 273 

G B F30H) is -2 kcal/mol at 25 °C; therefore, a single cut by 274 

a protease will cause the protein to unfold completely. Its 275 

unfolding can be followed by a 3 -fold increase in flu ores- 276 

cence of its single tryptophan (39); thus, we were able to 277 

measure the kinetics of initial cleavage events. Table 3 shows 278 

steady-state kinetic parameters for SBT* and calcium-free 279 

mutants Sbtl40 and Sbtl84. 280 

In terms of catalytic efficiency (k c JK m ), Sbtl40 is better 28 1 

than SBT*. However, when the substrate concentration 282 

exceeds 15 fiM (0.1 mg/mL), SBT* is better because its k CSLt 283 

is almost 6 times higher than Sbtl40. In fact, the peroxide- 284 

stable Sbtl84 also has a higher k cat than Sbtl40. To 285 

understand this behavior, we employed transient-state kinetic 286 

methods to determine microscopic rate constants. A general 287 

kinetic scheme for subtilisin catalysis is as follows (Scheme 288 
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Figure 2: Pre- steady- state kinetics versus the G B F30H substrate. 
Disappearance of the intact substrate as a ratio of [product]/ 
[enzyme] is plotted versus time after mixing with SBT* (•) and 
Sbtl40 (O). 

289 1) where E is the enzyme, S is the substrate, and P is the 

290 product. In this kinetic scheme, k\ is the rate of substrate 

291 binding, k-\ is the rate of substrate dissociation, k 2 is the 

292 rate of the acylation, and k 3 is a composite rate constant 

293 including both deacylation and release of the N- terminal 

294 product. We were not able to detect an acyl intermediate in 

295 any of the subsequent quench-flow experiments with either 

296 G B F30H or synthetic peptide substrates, and hence, we 

297 believe that the breakdown of the acyl intermediate is fast 

298 relative to the release of the peptide product. However, this 

299 has not been rigorously established. 

300 Pre-Steady-State Kinetics. To find out why k cat for SBT* 

301 is larger than k cat for Sbtl40, we measured pre- steady- state 

302 burst kinetics for both to determine the microscopic rate 

303 constants k 2 and k 3 . In these experiments, subtilisin Sbtl40 

304 or SBT* is mixed with a saturating concentration (500 fiM) 

305 of GbF30H in a preparative quench-flow apparatus. The 

306 reaction is quenched after variable reaction times by mixing 

307 with H3PO4 to shift the pH to 2.0. Each reaction time point 

308 is run over an analytical FPLC column so that the disap- 

309 pearance of intact G B F30H can be measured as a function 

310 of the reaction time. 

311 With Sbtl40, the time dependence for formation of cleaved 

312 product is described by a rapid exponential phase fol- 

313 lowed by a linear phase (Figure 2): [product]/[Sbtl40] = 

314 A 0 (l — e" A 0 + k cat t, where rates are defined by X = k 2 + k 3 

315 ~ 100 s -1 and k cat = k 2 k 3 /(k 2 + y = 4 s" 1 . The amplitude 

316 of the burst phase (A G ) = [k 2 /(k 2 + k 3 )] 2 = 0.9. Solving these 

317 equations yields k 2 ~ 100 s -1 and k 3 ~ 4 s -1 . 

318 With SBT*, no pre-steady-state burst is observable. The 

319 time dependence for formation of the cleaved product is 

320 linear: [product]/[SBT*] = k cat t, where k cat ~ k 2 = 20 s _1 . 

321 These results indicate that the acylation in the reaction of 

322 Sbtl40 with G B F30H is rapid when compared to SBT* 

323 (k 2 ~ 100 s _1 versus 20 s -1 )- The k cat for Sbtl40, however, 

324 is determined by k 3 (~4 s -1 ). 

325 Analysis of Kinetic Properties versus Succinyl-L-Asp-L- 

326 Val-L-Arg-L-Ala-L-Phe-7-amino-4-methylcoumarin (sDVRAF- 

327 AMC) and Succinyl-Ala-L-Ala-L-Phe-7 - amino -4-methyl- 

328 coumarin (sAAF-AMC). To further explore this phenomemon, 



T3 

> 

CD 
0) 
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Figure 3: Single turnover kinetics of Sbtl40 versus the sDVRAF- 
AMC substrate. Sbtl40 subtilisin at [E] = 0.48, 1.05, 2.35, 5.05, 
9.85, and 18.3 juM was mixed with 50 pM sDVRAF-AMC. The 
release of AMC is followed by fluorescence for each Sbtl40 
concentration. Solid lines are fits to Scheme 1 obtained with 
KinTekSim (19, 20). 

Table 4: Microscopic Rate Constants versus sDVRAF-AMC* 

k\ k-i k 2 k 3 K s K m k C3t K F 

(M" 1 s" 1 ) (s" 1 ) (s" 1 ) (s" 1 ) (juM) QiM) (s" 1 ) (t/M) 



Sbtl40 1.6 x 10 7 15 



65 



60 0.94 2.5 



31 



a Transient-state kinetic parameters were measured in 100 mM NaCl 
and 100 mM Tris-HCl at pH 8.3 and 25 °C. Values for microscopic 
rate constants were determined by fitting the single turnover data in 
Figure 3. 

we performed more detailed experiments using the defined, 329 

synthetic peptides: sDVRAF-AMC, which is a preferred 330 

subtilisin substrate, and sAAF-AMC, which is a poorer 331 

substrate. Steady-state parameters are summarized in Table 332 

3. Sbtl40 is a better enzyme versus sAAF-AMC than SBT* 333 

both in terms of k cat and k cat /K m . The steady-state results with 334 

sDVRAF-AMC are qualitatively similar to the results with 335 

the G B F30H substrate. That is, k cat /K m of Sbtl40 is greater 336 

than SBT*, but the k cat of Sbtl40 is much lower. We also 337 

observed a burst phase in the reaction of Sbtl40 with 338 

sDVRAF-AMC, indicating that acylation is not the rate- 339 

determining step. No burst phase was seen with sAAF-AMC. 340 

We performed a series of single turnover experiments as 341 

a function of [E] to determine microscopic rate constants 342 

for a single pass through a catalytic cycle. [E] was varied 343 

from 0.5 to 18 fiM with sDVRAF-AMC fixed at 50 pM. 344 

The data for Sbtl40 were fit according to Scheme 1 using 345 

KinTekSIM (Figure 3). Steady-state, single turnover, and 346 

product inhibition data were used to determine the rate and 347 

equilibrium constants reported in Table 4. 348 

The absence of a burst phase in the reaction of SBT* with 349 

sDVRAF at pH 8.3 indicates that acylation is limiting, as is 350 

generally observed for the hydrolysis of amide substrates 351 

by subtilisins. However, Acylation is not the rate-determining 352 

step of the reaction of Sbtl40 with sDVRAF-AMC. At pH 353 

8.3, k 2 and k 3 are roughly equal, which causes k cat to be half 354 

of the acylation rate. 355 

Burst-phase kinetics with amide substrates generally have 356 

not been reported with subtilisins because of a focus on 357 
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358 steady-state analysis and because the acylation step is rate- 

359 limiting for the commonly used peptide substrates such as 

360 sAAPF-pNA. No burst phase is observed when Sbtl40 is 

361 reacted with sAAPF-pNA. We estimate that the k 3 for sAAPF 

362 is ~10 3 s" 1 . Because k 3 greatly exceeds the rate of acylation, 

363 it does not influence the overall kinetics. Burst kinetics have 

364 been observed for the yeast-processing subtilisin, Kex2, 

365 against cognate sequences (40, 41). 

366 DISCUSSION 

367 Substrate Binding and A 1 -Terminal Product Release. Deg- 

368 radative subtilisins have evolved to be efficient catalysts 

369 against a broad range of substrate sequences. Subtilisin can 

370 bind to a broad range of sequences because many of its 

371 interactions are with the peptide backbone of the substrate. 

372 Most contacts are with the first four substrate residues on 

373 the acyl side of the scissile bond. These are denoted 

374 PI— P4, with numbering from the scissile bond toward the 

375 N terminus of the substrate (42—44). The backbone of the 

376 substrate inserts between strands 100—104 and 125—129 of 

377 subtilisin to become the central strand in an antiparallel 

378 /3-sheet arrangement involving seven main-chain hydrogen 

379 bonds (8, 43). Hence, a major component of substrate-binding 

380 energy involves only the peptide backbone and not side- 

381 chain interactions. The side-chain components of substrate 

382 binding to subtilisin result primarily from the PI and P4 

383 amino acids (5, 24, 45). Optimal substrates for subtilisin have 

384 large hydrophobic amino acids at these positions (24, 45), 

385 but even a substrate with AAAA at PI— P4 would be 

386 expected to bind with a ^ s of < 10 fiM (24, 26, 46, 47). 

387 It frequently is viewed as axiomatic for subtilisins in which 

388 £ ca t ~ k 2 (24). For the stabilized subtilisin Sbtl40, this was 

389 true for the substrates, sAAF-AMC and sAAPF— PNA, but 

390 was not true for the preferred substrate sDVRAF-AMC or 

391 for the complex protein substrate GbF30H. In both of these 

392 cases, tight substrate binding appears be correlated with slow 

393 product release. The effects of slow product release on 

394 steady-state kinetics can be discerned by examining the 

395 steady-state rate according to Scheme 1 . If the rate of product 

396 release is much faster than acylation then 

397 and 

K m ~ (k 2 + k^/k, 

398 However, if k 3 is in the same range as k 2 , then both k CSLt 

399 and K m will be decreased by a factor of k 3 /(k 2 + £3). Slow 

400 product release is not manifest in the ratio of k c JK m , which 

401 frequently is used as a measure of catalytic efficiency. By 

402 this measure, Sbtl40 subtilisin is a more efficient enzyme 

403 than SBT* across a range of substrates: sAAPF-pNA, sAAF- 

404 AMC, sDVRAF-AMC, and G B F30H. However, the value 

405 of fc C at for Sbtl40 subtilisin for a "good" peptide substrate 

406 such as sDVRAF-AMC or a complex protein such as 

407 GbF30H is much worse than SBT*. The reasons for this 

408 behavior can be seen by examining the microscopic rate 

409 constants for the hydrolysis of sDVRAF-AMC. 

410 The ratio k c JK m is the apparent second-order rate constant 

411 for substrate binding. It is less than the true binding rate 

412 (ki) by a factor of + fc-i) (48). For the reaction of 
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Sbtl40 with sDVRAF-AMC, k c JK m ~ h because the 413 

substrate off rate is small (k-\ = 15 s -1 ), compared to the 414 

acylation rate (fe = 65 s -1 ). As the substrate off rate becomes 415 

less than the acylation rate, its influence on the observed 416 

rate of product formation diminishes because of tight kinetic 417 

coupling between substrate binding and acylation. Further, 418 

because the slow substrate off rate is correlated to the slow 419 

release of the N- terminal product, k cat is attenuated. Thus, 420 

substrate affinity above a certain threshold is detrimental to 421 

catalysis, even though it has no effect on k c JK m . 422 

Tight-binding peptides, such as sDVRAF-AMC, appear 423 

to be better indicators of activity against a complex protein 424 

substrate than an average peptide such as sAAPF-pNA. The 425 

reaction of Sbtl40 with GbF30H is characterized by a rapid 426 

burst phase followed by a much slower steady-state phase. 427 

As with sDVRAF-AMC, Sbtl40 appears to bind tightly to 428 

preferred sequences in GbF30H and to release N- terminal 429 

products slowly. Interestingly, this effect can be mitigated 430 

by the mutation M222Q. This mutation decreases the 431 

acylation rate by a factor of 1 .6 but increases k CSLt because of 432 

a concomitant increase in £3. This mutation is of interest 433 

because it makes the enzyme stable in hydrogen peroxide 434 

because of the removal of the oxidation- sensitive methionine. 435 

(30). 436 

Stability and Catalytic Activity. The insertion of the 437 

substrate between subtilisin strands 100—104 and 125—129 438 

to become the central strand in an antiparallel /3- sheet 439 

arrangement suggests that increased subtilisin stability might 440 

result in tighter substrate binding. However, this does not 441 

appear to be the case. The kinetic behavior observed with 442 

Sbtl40 subtilisin does not appear to be a function of stability 443 

but rather to be dependent on specific mutations, which 444 

directly alter substrate and product interactions. The mutation 445 

that has the largest effect on substrate binding is E156S. This 446 

mutation has a minor effect on stability but amplifies the 447 

natural preference of subtilisin for large hydrophobic residues 448 

at the PI site by 2-5-fold (26). The additive effects of 449 

E156S, G169A, and N218S increase substrate affinity by 450 

^15-fold and together create an enzyme whose k CSLt is 451 

attenuated by product release in the presence of high- affinity 452 

substrates. The effect is mitigated to some extent by Y217L, 453 

which increases the acylation rate by ~5-fold and decreases 454 

K s by -5-fold. Y217L effects on /c ca t and K m depend on k— \ 455 

and £3 for a particular substrate. The kinetic properties of 456 

Sbtl40 appear to be roughly similar to the S156, A169, and 457 

L217 mutants of BPN' described by Wells et al. (26). Natural 458 

evolution appears to compromise between substrate binding 459 

versus acylation rate. In most Bacillus subtilisins, Y217 and 460 

E156 occur together or L218 and S156 occur together (49). 461 

Natural subtilisins may require a great deal of versatility to 462 

deal with folded and unfolded substrates over a range of 463 

concentrations. Preservation of broad proteolytic activity 464 

requires that N- terminal off rates do not attenuate k CSLt . It is 465 

possible, however, to tailor the catalytic profile to create a 466 

more specialized protease. For example, Sbtl40 is a very 467 

good protease in situations in which tight substrate affinity 468 

is important (e.g., substrates at low concentration, of non- 469 

optimal sequence, or in folded conformation). Tailoring of 470 

the catalytic profile appears to be compatible with high 471 

protease stability. We recently have been able to extend this 472 

work to evolve subtilisins with narrow substrate preferences 473 

to create useful processing activities in subtilisin (50). 474 
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/. Introduction 

Almost one-third of all proteases can be classified 
as serine proteases, named for the nucleophilic Ser 
residue at the active site. This mechanistic class was 
originally distinguished by the presence of the Asp— 
His-Ser "charge relay" system or "catalytic triad". 1 
The Asp-His-Ser triad can be found in at least four 
different structural contexts, indicating that this 
catalytic machinery has evolved on at least four sep- 
arate occasions. 2 These four clans of serine proteases 
are typified by chymotrypsin, subtilisin, carboxypep- 
tidase Y, and Clp protease (MEROPS nomenclature; 3 
Table 1). More recently, serine proteases with novel 
catalytic triads and dyads have been discovered, 
including Ser-His-Glu, Ser-Lys/His, His-Ser-His, 
and N-terminal Ser. 2 Several of these novel serine 
proteases are subjects of accompanying articles in 
this issue. 

This article will review recent work on the mech- 
anism and specificity of chymotrypsin-like enzymes, 
with the occasional references to pertinent experi- 
ments with subtilisin. Chymotrypsin-like proteases 
are the most abundant in nature, with over 240 
proteases recognized in the MEROPS database. 3 
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Table 1. Serine Protease Clans a 



clan 


members 


example 


catalytic residues 


distribution 






"Classic" Catalytic Triad Serine Proteases 




PA(S) 


301 


chy mo try ps in 


His - Asp - Ser 


B, Ar, F, PI, An, V 


SB 


91 


subtilisin 


Asp - His - Ser 


B, Ar, Pr, F,Pl,An,V 


SC 


64 


ra rhnwnpnt i Hasp Y 

V>GL1 UUA Y UCULlUCloC A 


Spr — A cri — l-fic 


B, Ar, Pr, F, PI, An, 


SK 


14 


Clp protease 


Ser-His-Asp 


B, Ar, Pr, F, Pl,An,V 






"Novel" Serine Proteases 






SE 


16 


D-Ala-D-Ala carboxypeptidase A 


Ser-Lys 


B, Ar, PI, An 


SF 


24 


signal peptidase I 


Ser-Lys/His 


B, Ar, F, PI, An, V 


SH 


7 


cytomegalovirus assemblin 


His-Ser-His 


V 


SM 


7 


C-terminal processing protease- 1 


Ser-Lys 


B, Ar, PI 


SN 


4 


dipeptidase E 


Ser-His-Glu 


B, An 


PB(S) 


4 


penicillin amidohydrolase precursor 


N-terminal Ser 


B, Ar, Pr 



a The MEROPS database classifies proteases based on structural similarity [Rawlings, N. D., and Barrett, A. J. (2000) MEROPS: 
The Peptidase Database. Nucleic Acids Res. 28, 323 - 325]. The following information is tabulated from MEROPS release 5.7 
[http://merops.iapc.bbsrc.ac.uk/]. Catalytic residues are listed in the order that they appear in the primary sequence. B, bacteria; 
Ar, archaea; Az, archezoa; Pr, protozoa; F, fungi; PI, plants; An, animals; V, viruses. In addition to the proteases listed below, 21 
serine proteases have been identified that are not yet assigned to a clan. 



These proteases can be found in eukaryotes, prokary- 
otes, archae, and viruses. Chymotrypsin-like pro- 
teases are involved in many critical physiological 
processes, including digestion, hemostasis, apoptosis, 
signal transduction, reproduction, and the immune 
response (Table 2). 4-8 "Cascades" of sequential acti- 
vation of serine proteases drive blood coagulation, 
complement fixation, and fibrinolysis. 9 " 11 Similar 
protease cascades appear to be involved in develop- 
ment, matrix remodeling, differentiation, and wound 
healing. 12-14 These diverse physiological niches de- 
mand proteases with wildly varied specificities, rang- 
ing from digestive proteases that cleave after hydro- 
phobic or positively charged residues, to proteases 
that recognize a five-residue cleavage site or even a 
single protein. Serine protease specificity can usually 
be rationalized by the topology of the substrate 
binding sites adjacent to the catalytic triad (the 
"active site cleft"). Several excellent reviews have 
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Table 2. Representative Mammalian 
Chymotrypsin-Like Proteases 



digestive proteases 
chymotrypsin 
trypsin 

pancreatic elastase 
immune response 
tryptase 

complement factor D 

complement factor B 

complement factor C 

complement component 2 

mast cell protease 

cathepsin G 

neutrophil elastase 
blood coagulation 

coagulation factor Vila 

coagulation factor IXa 

coagulation factor Xa 

coagulation factor Xlla 

thrombin 

protein C 
fibrinolysis 

urokinase 

tissue plasminogen activator 

plasmin 

kallikrein 
reproduction 

prostate specific antigen 

acrosein 
bacterial homologues 

alpha lytic protease 

Streptomyces griseus protease A 

Streptomyces griseus protease B 

recently reported on this topic. 15 " 17 However, while 
this perspective implies that specificity can be re- 
duced to discrimination in substrate binding, speci- 
ficity is primarily expressed in the rates of chemical 
transformation. This review will discuss the expres- 
sion of specificity during catalysis and strategies for 
substrate discrimination, focusing on the chymo- 
trypsin-trypsin-elastase paradigm and including 
examples from Factor D, alpha lytic protease, and 
thrombin. 

//. The Structure of Chymotrypsin-Like Serine 
Proteases 

First we will consider the structure of chymo- 
trypsin-like proteases (Figure 1). Chymotrypsin has 
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Figure 1. Structure of chymotrypsin. This figure was 
produced with Molscript v2.1 (ref 267) and POV-Ray by 
Annette Pasternak. 

245 residues arranged in two six-stranded beta 
barrels. 18 The active site cleft is located between the 
two barrels. This structure is generally divided into 
catalytic, substrate recognition and zymogen activa- 
tion domain components that are common to all 
chymotrypsin-like serine proteases. (The "loop 1" 
nomenclature used in some previous papers and 
reviews will be replaced with a system based on 
chymotrypsinogen numbering to facilitate compari- 
son among different proteases.) This organization is 
derived from Kraut 19 and is convenient for discussion, 
but wrongly implies that catalysis, substrate recogni- 
tion, and zymogen activation are separate processes. 
As will be seen, these three processes involve many 
of the same structural features and are intricately 



intertwined. In addition to the protease domain, 
many serine proteases are modular proteins contain- 
ing auxiliary regulatory domains (e.g., kringle, EGF, 
etc.). 20 The functions of these auxiliary domains will 
not be considered here. 

A. The Catalytic Components 

7. The Catalytic Triad 

The catalytic triad spans the active site cleft, with 
Serl95 on one side and Aspl02 and His57 on the 
other (Figure 1). His57 assumes the less favorable 
N(51-H tautomer (Figure 2), indicating that the 
tautomer equilibrium is perturbed at least lOOx from 
the solution position. 21 The catalytic triad is part of 
an extensive hydrogen bonding network (Figure 2A). 
Hydrogen bonds are generally observed between the 
Ndl-H of His57 and 0(5 1 of Asp 102 and between the 
OH of Serl95 and the Ne2-H of His57, although the 
latter hydrogen bond is lost when His57 is protonated 
(Figure 2). Similar hydrogen bonds are observed in 
protein inhibitor (eglin C), acylenzyme, and transition 
state analogue (trifluoroketone) complexes, which 
suggests that these hydrogen bonds are present 
thoughout the catalytic cycle. Interestingly, the 
His57— Asp 102 hydrogen bond has the syn orienta- 
tion relative to the carboxylate, so that the hydrogen 
bond forms with the more basic electron pair. 22,23 The 
OH of Ser214 forms a hydrogen bond with 0(31 of 
Asp 102 in almost all chymotrypsin-like proteases. 
Ser2 14 was once considered the fourth member of the 
catalytic triad, although more recent evidence indi- 
cates that it is dispensable. 24 25 Hydrogen bonds are 
also observed between the 0(32 of Asp 102 and the 
main chain NHs of Ala56 and His57. These hydrogen 
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Figure 2. Hydrogen bonding in the catalytic triad. (A) The catalytic triad in chymotrypsin complexes. The hydrogen 
bonding network of the catalytic triad is depicted for the complex with a protein inhibitor eglin C, an acylenzyme 
intermediate and a trifluoroketone transition state analogue. The dotted lines represent potential hydrogen bonds and 
the numbers are the distances between heteroatoms in A. (B) The effect of the C61-H hydrogen bond on charge distribution 
in His57. In the absence of the hydrogen bond, the positive charge will accumulate at N<51 where it can be stabilized by 
Aspl02. 
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bonds are believed to orient Asp 102 and His57. In 
addition, a novel hydrogen bond is observed between 
the Ccl-H of His57 and the main chain carbonyl of 
Ser214. 26 ' 27 Similar hydrogen bonds are also observed 
in subtilisin and other classes of serine hydrolases, 
suggesting that this interaction may be important for 
catalytic triad function. 26 This hydrogen bond may 
prevent the localization of positive charge on Nd 1 of 
His57 (Figure 2B). Interestingly, the carbonyl oxygen 
of Ser214 is also part of the polypeptide binding site 
(see below), and may mediate communication be- 
tween substrate and the catalytic triad. 

Z The Oxyanion Hole 

The oxyanion hole is formed by the backbone NHs 
of Glyl93 and Serl95. These atoms form a pocket of 
positive charge that activates the carbonyl of the 
scissile peptide bond and stabilizes the negatively 
charged oxyanion of the tetrahedral intermediate (see 
below). The oxyanion hole is structurally linked to 
the catalytic triad and the He 16— Asp 194 salt bridge 
via Serl95. Subtilisin also contains an oxyanion hole, 
formed by the side chains of Asnl55, Thr220, and the 
backbone NH of Ser221. 28 

B. The Substrate Recognition Sites 

The substrate recognition sites include the polypep- 
tide binding site and the binding pockets for the side 
chains of the peptide substrate. Investigations of 
protease specificity have generally focused on the 
Pl/Sl interaction (based on the nomenclature of 
Schechter and Berger (ref 29), where Pl-PT denotes 
peptide residues on the acyl and leaving group side 
of the scissile bond, respectively. The adjacent peptide 
residues are numbered outward, and SI, ST, etc. 
denote the corresponding enzyme binding sites), 
followed by consideration of the P2-Pn/S2-Sn in- 
teractions. The ST— Sn' sites are generally overlooked 
due to the prevalence of assays using substrates with 
chromophoric leaving groups. We will consider the 
SI site, the polypeptide binding site, the S2-Sn and 
Sl'-Sn' sites separately. 

I The SI Site 

Specificity of chymotrypsin-like serine proteases is 
usually categorized in terms of the PI— SI interac- 
tion. The SI site is a pocket adjacent to Serl95, 
formed by residues 189-192, 214-216, and 224-228. 
Specificity is usually determined by the residues at 
positions 189, 216, and 226 (reviewed in refs 15 and 
17). For example, the specificity of chymotrypsin 
correlates with the hydrophobicity of the PI residue, 
with Pl-Phe preferred over Ala by 50000. 30 - 31 The 
combination of Serl89, Gly216, and Gly226 create a 
deep hydrophobic pocket in chymotrypsin that ac- 
counts for this specificity. 18 Aspl89, Gly216, and 
Gly226 create a negatively charged SI site that 
accounts for trypsin's specificity for substrates con- 
taining Arg or Lys at Pi. 32 33 Elastase prefers sub- 
strates with small aliphatic residues at PI; the SI 
site of elastase is smaller than the SI sites of 
chymotrypsin and trypsin due to the presence of 
Val216 and Thr226. 34 The specificity of granzyme B 



for acidic PI residues is attributed to Arg226. 35 These 
observations suggest that a small set of structural 
elements controls the specificity of these proteases. 
However, as detailed below, simple transposition of 
these structural elements does not switch SI specific- 
ity. 

2. The Polypeptide Binding Site 

Protease— substrate interactions extend beyond the 
S 1 site, minimally including the polypeptide binding 
site and often involving additional binding pockets. 
The polypeptide binding site refers to the main chain 
of residues 214—216 which form an antiparallel beta 
sheet with the backbone of the P1-P3 residues of a 
peptide substrate (Figure 3). In chymotrypsin, hy- 
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Figure 3. Subsite binding of porcine pancreatic and 
human leukocyte elastase. The hydrogen bonding interac- 
tion of the polypeptide binding site and S' sites are shown 
by dotted lines. The residues that form each subsite are 
listed. Modified from ref 53. 

drogen bonds form between the carbonyl oxygen of 
Ser214 and the NH of the PI residue, the NH of 
Trp215 and the carbonyl of P3 and the carbonyl of 
Gly216 and the NH of P3. These interactions are a 
general feature of chymotrypsin-like proteases and 
are critical for efficient substrate hydrolysis. Inter- 
estingly, Gly216 has different conformations in chy- 
motrypsin, trypsin, and elastase, 16 which suggests 
that the strength of this hydrogen bond will vary. 
Note that residues 214—216 also form one wall of the 
SI site, and that the carbonyl of Ser214 forms a 
hydrogen bond to His57. These structural interac- 
tions form a line of communication between the 
polypeptide binding site, the SI site, and the catalytic 
triad. 

Interestingly, the inhibitory loops of protein inhibi- 
tors of serine proteases are locked in the extended 
conformation required to form the antiparallel beta 
sheet with the polypeptide binding site. This ex- 
tended conformation, termed the "canonical confor- 
mation", is observed in numerous protease-inhibitor 
complexes, including transition state analogues and 
peptidyl acylenzymes. 36-45 The canonical conforma- 
tion includes the P3— P3' residues and can extend to 
the amide NH of residue 218 and the carbonyl oxygen 
of P5 as observed in elastase (Figure 3). Importantly, 
the beta sheet structure causes the side chains the 
peptide substate to point in alternating directions. 

The relevance of the antiparallel beta sheet inter- 
action to peptide hydrolysis has recently been called 
into question because disruption of these hydrogen 
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bonding interactions had little effect on substrate 
hydrolysis. 46 However, the values of K& for protein 
inhibitors and transition state analogues correlate 
with k cat /K m for hydrolysis of the analogous sub- 
strates, which argues strongly that substrates also 
assume the canonical conformation. 38 ' 47-49 

3. The S2-Sn Sites 

The S2-S3 sites of chymotrypsin display little 
substrate discrimination, in keeping with chymo- 
trypsin's function as a digestive protease. 50 The S2 
site is a shallow hydrophobic groove bounded by 
His57, Trp215, and Leu99; the specificity of the S2 
site correlates with hydrophobicity of the P2 side 
chain. 49 The S3 site of chymotrypsin has little speci- 
ficity, and can even accommodate D-amino acids; the 
side chain of the P3 residue protrudes out of the 
active site cleft. 

Insertions in the loops surrounding the active site 
cleft create more defined pockets for the S2— Sn sites 
of other serine proteases. These sites can be the major 
determinant of specificity, 51 as exemplifed by elastase, 
where the P3/S3 interaction is dominant, 52 - 53 and 
enterokinase (also known as enteropeptidase), which 
recognizes the P5-P1 sequence Asp-Asp-Asp- 
Asp-Lys with restriction enzyme-like precision. 54 

4. The SV-S2 Sites 

The interactions of the leaving group with the S V— 
Sn' sites are usually inferred from protein inhibitor 
structures that do not contain the optimum PI'— Pn' 
residues. Therefore, the precise interactions of the 
P' side chains have not been defined. A hydrogen 
bond between the mainchain NH of the P2' residue 
and the mainchain carbonyl oxygen of Phe41 is a 
constant feature of the P7S' interactions. Chymo- 
trypsin prefers substrates with PI'— Arg or Lys, 
which is attributed to electrostatic interactions with 
Asp35 and Asp64. 55 The PT and P3' residues point 
in the same direction as a consequence of the beta 
sheet alignment of the substrate, so that the ST and 
S3" sites overlap. The S17S3' sites are bounded by 
His57, the 60's loop and the 40's loop. The P2' residue 
points in the opposite direction and may interact with 
the 150's loop. As above, insertions in these loops can 
create more defined ST- S3' binding sites. 51 

C. The Zymogen Activation Domain 

Chymotrypsin-like proteases are synthesized as in- 
active precursors ("zymogens") containing N-terminal 
extensions. Four segments are deformed in the zy- 
mogens of chymotrypsin and trypsin: the N-terminus 
to residue 19, residues 142-152, 184-193, and 216- 
223 (these regions are collectively termed the activa- 
tion domain 36 ). This deformed region includes the SI 
site and oxyanion hole, which explains the low 
activity of the zymogen. Proteolytic processing acti- 
vates the zymogen, releasing the N-terminal Ilel6. 
The new N-terminus forms a buried salt bridge with 
Asp 194, inducing a conformational change that or- 
ders the activation domain. The SI site and oxyanion 
hole are formed, creating the active protease. Impor- 



tantly, the loss of protease activity at high pH is 
attributed to the deprotonation of the N-terminus 
and disruption of Ilel6-Aspl94 salt bridge, shifting 
the conformational equilibrium toward an inactive 
zymogen-like conformation. 56 57 This mechanism of 
zymogen activation is conserved among mammalian 
chymotrypsin-like serine proteases. Bacterial homo- 
logues have a different mechanism of zymogen acti- 
vation, but retain a salt bridge involving Aspl94. 58 

///. The Mechanism of Serine Proteases 

Obviously, a detailed understanding of the enzy- 
matic reaction mechanism is required to understand 
how specificity is generated. Fortunately, chymo- 
trypsin-like proteases occupy a distinguished place 
in the annals of biochemistry as perhaps the best 
studied enzyme system. 

A. The Generally Accepted Chemical Mechanism 
of Serine Protease Catalysis 

All proteases must overcome three obstacles to 
hydrolyze a peptide bond: (a) amide bonds are very 
stable due to electron donation from the amide 
nitrogen to the carbonyl. For comparison, a simple 
alkyl ester is ~3000x more reactive than an amide 
bond, while a p-nitrophenyl ester is 300000 x more 
reactive. 59 Proteases usually activate an amide bond 
via the interaction of the carbonyl oxygen with a 
general acid, and may also distort the peptide bond 
to disrupt resonance stabilization; (b) water is a poor 
nucleophile; proteases always activate water, usually 
via a general base; and (c) amines are poor leaving 
groups; proteases protonate the amine prior to expul- 
sion. Serine proteases perform these tasks very 
efficiently: the rates of peptide hydrolysis by serine 
proteases are ~10 10 -fold greater than the uncatalyzed 
reactions. Obviously, these mechanisms of catalysis 
are not confined to peptide hydrolysis; serine pro- 
teases also readily hydrolyze other acyl compounds, 
including amides, anilides, esters, and thioesters. 

Figure 5 displays the generally accepted mecha- 
nism for chymotrypsin-like serine proteases. In the 
acylation half of the reaction, Serl95 attacks the 
carbonyl of the peptide substrate, assisted by His57 
acting as a general base, to yield a tetrahedral 
intermediate. The resulting His57-H + is stabilized by 
the hydrogen bond to Asp 102. The oxyanion of the 
tetrahedral intermediate is stabilized by interaction 
with the main chain NHs of the oxyanion hole. The 
tetrahedral intermediate collapses with expulsion of 
leaving group, assisted by His57-H + acting as a 
general acid, to yield the acylenzyme intermediate. 

B-H+ B-H+ 

6) o-\ o 

i „ R-C-NHR' ^ Jl +R'NH 2 

IT / X NHR' \>H 



Figure 4. Catalysis of peptide hydrolysis. The catalysis 
of peptide hydrolysis involves activation of the carbonyl via 
a general acid, activation of water with a general base, and 
protonation of the amine leaving group. 
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Figure 5. The generally accepted mechanism for serine proteases. 



The deacylation half of the reaction essentially 
repeats the above sequence: water attacks the acyl- 
enzyme, assisted by His57, yielding a second tetra- 
hedral intermediate. This intermediate collapses, 
expelling Serl95 and carboxylic acid product. The 
acylenzyme intermediate is well established as dis- 
cussed below, but the tetrahedral intermediates are 
inferred from solution chemistry. The transition 
states of the acylation and deacylation reactions will 
resemble the high energy tetrahedral intermediates, 
and the terms "transition state" and "tetrahedral 
intermediate" are often used indiscriminately in the 
literature. It is worth noting that a web of hydrogen 
bonding interactions links the substrate binding sites 
to the catalytic triad. As the reaction proceeds, 
changes in bonding and charge at the scissile bond 
will propagate to more remote enzyme— substrate 
interactions, and vice versa. The evidence supporting 
this mechanism has been reviewed extensively, and 
will be only briefly described here. 19 - 56 - 60 " 62 



7. The Discovery of the Catalytic Triad and the Oxyanion 
Hole 

Initial investigations of serine proteases focused on 
enzymes that could be readily obtained in large 
quantities: bovine chymotrypsin and trypsin and 
porcine elastase. Inactivators revealed the first clues 
into the catalytic machinery. Nerve gases such as 
diisopropylfluorophosphate (DFP) inactivate chymo- 
trypsin by specifically modifying Serl95 (Figure 
6) 63 " 65 and His57 is modified when chymotrypsin is 
inactivated by tosyl-L-phenylalanine chloromethyl 
ketone (TPCK). 66 The involvement of His57 was also 
suggested by the pH rate profiles which indicated 
that the deprotonation of a residue with pK a ~ 7 was 
important for activity. 56 Subsequent NMR experi- 
ments showed that the pAa of His57 is —7. 21 The third 
member of the catalytic triad, Asp 102, was discovered 
when Blow solved the crystal structure of chymo- 
trypsin. 67 Blow proposed that the hydrogen bonding 
network between Asp 102, His57, and Serl95— the 
charge relay system— activates Serl95 for nucleo- 
philic attack. A charge relay system was rapidly 
identified in trypsin, elastase, and other related 
enzymes, 32 - 33 - 68 and a similar catalytic triad of Asp32, 
His 64, and Ser221 was discovered in subtilisin. 69 70 
The oxyanion hole was first noted by Henderson, who 
proposed that the main chain NHs of Glyl93 and 
Serl95 stabilize the negatively charged oxyanion of 
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Figure 6, Mechanism inhibitors that form tetrahedral 
adducts. 

the tetrahedral intermediate. 71 Crystal structures of 
several transition state analogue complexes confirm 
this interaction. 33 ' 45 ' 72 " 74 

2. Mutagenesis of the Catalytic Triad 

Many experiments confirm the importance of the 
catalytic triad residues in serine protease catalysis. 
Chemical modification experiments were first used 
to disrupt the catalytic triad, either removing the 
hydroxy! of Serl95 to create anhydro proteases or 
methylating His57 at N62. 75 - 76 Either modification 
decreases protease activity by at least 10 4 -fold. 77 - 78 
Site-directed mutagenesis experiments provided a 
more incisive means to alter the catalytic triad. In 
trypsin, the substitution of either Serl95 or His57 
with Ala decreases the value of k cat IK m by 10 5 . 79 (Such 
values should most rigorously be considered upper 
limits for the activity of a given mutant enzyme. 
Water can substitute for a missing functional group, 
thus providing an alternative mechanism of catalysis 
and underestimating the effect of the mutation. In 
addition, factors such as translation misincorporation 



Serine Protease Mechanism and Specificity 
P2Tftr 




Figure 7. The P2 and PV residues bracket His57. The 
P2 and PT residues from the chymotrypsin- turkey ovo- 
mucoid third domain complex at 2.0 A, PDB accession 
number 1CHO. This figure was produced with Molscript 
v2.1 (ref 267) and POV-Ray by Lu Gan. 

and contaminating enzymes must also be considered 
when evaluating mutant enzymes with very low 
activities.) No further loss of activity is observed 
when the remaining catalytic triad residues are 
substituted. Thus, the substitution of either His57 
or Serl95 is sufficient to completely disable the 
catalytic triad. The substitution of Asp 102 to Asn in 
trypsin decreases k cat /K m by 10 4 at neutral pH. 80,81 
The analogous mutations of the catalytic triad have 
similar effects in subtilisin. 82 However, it is important 
to recognize that the catalytic triad is not the sole 
source of protease catalytic power: even with the 
catalytic triad disabled, these mutant proteases 
hydrolyze substrates at ~10 3 x the uncatalyzed 
rate. 79,82 This remaining activity reflects the contri- 
bution of the oxyanion hole, desolvation, and perhaps 
the canonical conformation to catalysis. 

Interestingly, the activity of His57 mutations can 
be rescued by substrates containing His. This sub- 
strate assisted catalysis was first demonstrated in 
subtilisin and has recently been extended to tryp- 
sin. 83-86 In the absence of His57, the presence of P2— 
His or PT-His increases peptide hydrolysis by as 
much as 300-fold. As noted above, His57 interacts 
with both the P2 and PT residues (Figure 7); it is 
believed that the P2- and Pl'-His can replace His57 
in the catalytic triad, forming similar interactions 
with Serl95 and Aspl02. 

A second instance of chemical rescue is found in 
mutations of Asp 102. Hydroxide ion increases the 
activity of Asp 1 02 Asn trypsin to within 1 0% of wild 
type at pH 10. 80 Similar hydroxide activation is also 
observed in subtilisin. 82 The mechanism of this 
hydroxide rescue is not understood. 

Unfortunately, since the oxyanion hole of chymo- 
trypsin-like enzymes is composed of mainchain at- 
oms, site directed mutagenesis cannot be used to 
confirm its role in catalysis. The oxyanion hole of 
subtilisin is composed of the side chains of Asn 155 
and Thr220 as well as the backbone NH of Ser221. 
Substitution of Asnl55 decreases k cat /K m by 100— 10 3 - 
fold, indicating that this residue provides 3—4 kcal/ 
mol of transition state stabilization energy. 87-90 The 
substitution of Thr220 with Ala decreases k^JKm by 
~20-fold (~2 kcal/mol) and the removal of both 
Asnl55 and Thr220 decreases activity by 10 4 -fold, 
suggesting that the oxyanion hole can stabilize the 
transition state by as much as 6 kcal/mol. However, 
no further loss of activity is observed when oxyanion 
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mutations are combined with mutations of the cata- 
lytic triad, indicating that the oxyanion and catalytic 
triad function cooperatively. Therefore, the 6 kcal/ 
mol of transition state stabilization is not simply the 
measure of the interactions at the oxyanion hole. 90 
The collaborative function of the catalytic machinery 
makes it impossible to precisely ascribe energies to 
individual interactions. Interestingly, Thr220 ap- 
pears to be too far away to hydrogen bond directly to 
a tetrahedral intermediate; either Thr220 moves 
during catalysis, or stabilization results from long- 
range electrostatic interactions. Molecular modeling 
experiments support both mechanisms. 88 91 

3. Evidence for the Acylenzyme Intermediate 

The famous burst experiment of Hartley and Kil- 
bey was the first evidence for the acylenzyme inter- 
mediate. 92 The hydrolysis of p-nitrophenyl acetate by 
chymotrypsin is biphasic, with one enzyme-equiva- 
lent of p-nitrophenol produced in the rapid phase, 
indicating that an enzyme— acetate intermediate 
formed and decomposition of this intermediate was 
rate-limiting. Acylenzyme intermediates were sub- 
sequently isolated in several reactions of esters with 
chymotrypsin, and the chemical/kinetic competence 
of these intermediates was demonstrated. 56 However, 
similar experiments failed to detect acylenzyme 
intermediates in the hydrolysis of amide substrates. 
This failure could be readily explained if the forma- 
tion of the acylenzyme was rate-limiting, so that the 
acylenzyme no longer accumulated to detectable 
levels during amide hydrolysis. The acylenzyme 
intermediate could be detected in acyltransfer experi- 
ments. 93 As shown in Figure 8, the acylenzyme 
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Figure 8. The mechanism of acyltransfer reactions of 
serine proteases. The acylenzyme normally reacts with 
water to form a carboxylic acid. However, other nucleo- 
philes can also react with the acyl enzyme. If the added 
nucleophile is the N-terminus of a peptide, the reaction is 
the reverse of the hydrolysis reaction and displays the same 
specificity. 

usually reacts with water, but can also react with 
other nucleophiles. The partitioning of acyltransfer 
between water and nucleophile will be constant for 
a given acylenzyme, regardless of how the acyl- 
enzyme was formed. Esters and amides have identi- 
cal partition ratios in acyltransfer reaction, indicating 
that their reactions involves a common acylenzyme. 
Note that if the nucleophile is a peptide, the acyl- 
transfer reaction is the reverse of peptide hydrolysis, 
and must exhibit the same specificity. These reac- 
tions have proven useful in mapping the specificity 
of the Suites. 94 " 99 

Several X-ray crystal structures of acylenzyme 
intermediates have been solved. The first structure 
of a peptidyl acylenzyme was a surprise revealed 
when high-resolution data uncovered a peptide in the 
active sites of y-chymotrypsin. 41100101 The peptide is 
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probably Pro— Gly— Ala— Tyr, but may be a mixture 
derived from autoproteolysis. At low pH, continuous 
density is observed between Serl95 and the carbonyl 
of the peptide, indicating that the acylenzyme is 
formed. Interestingly, the carbonyl group is not 
completely in the oxyanion hole: the carbonyl oxygen 
forms a hydrogen bond with Glyl93 (2.8 A), but not 
with Serl95 (3.5 A). At high pH, the covalent bond 
between Serl95 and the peptide is lost, indicating 
that the product complex has formed. The peptide 
can be readily replaced with inhibitors, further 
demonstrating that this acylenzyme is catalytically 
competent. A peptide was also discovered in the 
active site of Streptomyces griseus proteinase A 
(SGPA). 42 This peptide appears to be a mixture of 
autolysis products. As in y-chymotrypsin, the peptide 
appears to be an acylenzyme at low pH and a product 
complex at high pH. Unlike y-chymotrypsin, the 
carbonyl oxygen of the acylenzyme occupies the 
oxyanion hole. More recently, Wilmouth and col- 
leagues screened a peptide library to isolate a stable 
peptidyl acylenzyme of elastase. 102 /3-Casomorphin is 
a heptapeptide that forms a stable acylenzyme at low 
pH via the C-terminal carboxylate. The carbonyl 
oxygen is also found in the oxyanion hole in this 
acylenzyme. Although initial reports suggested that 
this carbonyl group was distorted toward a tetra- 
hedral structure, more recent high-resolution data 
indicate that the ester is planar. 102 The stability of 
these acylenzymes can be attributed to the protona- 
tion of His57 at low pH. All three of these peptidyl 
acylenzymes display the canonical conformation. 

4. Evidence for the Tetrahedral Intermediate 

As yet, the tetrahedral intermediate has not been 
rigorously demonstrated in the serine protease reac- 
tion, but is included on the basis of solution chem- 
istry. 103104 Tetrahedral intermediates are unstable 
and have very short lifetimes. 78 Further, the lifetime 
of the intermediate decreases as the leaving group 
improves. In favorable circumstances, such leaving 
group effects can provide evidence for tetrahedral 
intermediates: as the pK a of the leaving group 
decreases below that of the nucleophile, formation of 
the tetrahedral intermediate becomes rate-limiting 
and a characteristic break will be observed in the 
Bronsted plot. 105 In the case of the best leaving 
groups such as p-nitrophenol, the lifetime of the 
tetrahedral intermediate becomes less than a vibra- 
tion, the reaction becomes concerted and the tetra- 
hedral species must be considered a transition 
state. 106 " 108 As in solution, the hydrolysis of p- 
nitrophenylacetate by chymotrypsin is a concerted 
reaction, 109 and it is formally possible that enzyme- 
catalyzed peptide hydrolysis is also concerted. Of 
course, this would require an enormous activation of 
the amine leaving group. Unfortunately, the leaving 
group effects measured in Bronsted analysis are 
greatly reduced by general base catalysis, so that 
Bronsted analysis of chymotrypsin catalyzed reac- 
tions provides no information about the tetrahedral 
intermediate. 110111 14 N/ 15 N isotope effects have been 
similarly equivocal. 62 



Not surprisingly, the observation of the tetrahedral 
intermediate by X-ray crystallography has proven 
elusive. 112 The first report of a tetrahedral intermedi- 
ate in the trypsin-BPTI complex could not be sub- 
stantiated by NMR experiments; further refinement 
revealed that the scissile peptide bond is distorted 
toward tetrahedral, but no covalent bond is formed 
with Serl95. 113114 Interestingly, this bond is also 
distorted in the absence of trypsin. 115 Another tetra- 
hedral intermediate has recently been reported in 
crystals of elastase and the peptide /3-casomorphin. 102 
When crystals containing the peptidyl acylenzyme 
are transferred to pH 9.0 and flash-frozen, electron 
density is observed that suggests a tetrahedral ad- 
duct. However, this exciting result must be ap- 
proached with great caution. X-ray crystallography 
is a time averaged method, and this tetrahedral 
adduct may be a mixture of complexes. The presence 
of sulfate in the cryoprotectant solution is especially 
troubling because sulfate binds in the active site of 
many serine proteases and can easily be mistaken 
for a tetrahedral interemediate (see refs 116 and 117 
and references therein) . The kinetics of intermediate 
formation are also not consistent with a pseudo first 
order process (the tetrahedral intermediate is present 
at 1 min but completely decomposed after 2 min), 
which further suggests that the "tetrahedral inter- 
mediate" is actually a mixture of complexes. A more 
promising tetrahedral intermediate appears when 
crystals the peptidyl acylenzyme of y-chymotrypsin 
are soaked in hexane. 118 The electron density around 
the acyl carbonyl appears to be tetrahedral, and 
density is observed in the PI' site. These observations 
suggest that a peptide from the crystallization mix 
has attacked the acylenzyme, forming a stable tetra- 
hedral intermediate. The stability of this intermedi- 
ate is not understood, but might be explained by a 
shift in equilibrium and/or hydrogen bonding ener- 
gies in organic solvent. Unfortunately, tetrahedral 
intermediates have come and gone; the putative 
tetrahedral intermediates of both elastase and chy- 
motrypsin should be substantiated by additional 
methods. 

B. Mechanisms of Inactivation and Inhibition 

A comprehensive discussion of inhibitor/inactivator 
mechanisms can be found in the Powers paper in this 
issue. The following will be a brief discussion of 
compounds that have played an important role in 
unraveling the mechanism of serine protease cataly- 
sis and substrate specificity. 

1. Transition State Analogue Inhibitors 

Several classes of inhibitors form stable tetrahedral 
adducts that mimic the tetrahedral intermediate/ 
transition state of the serine protease reaction. The 
specificity of these reagents can be tailored to a given 
protease by combining the reactive groups with the 
appropriate peptidyl portions. 72119 DFP and the less 
toxic phenylmethanesulfonylfluoride (PMSF) inacti- 
vate virtually all serine proteases and are diagnostic 
for this protease class. These compounds are very 
electrophilic, and react irreversibly with Serl95 to 
form stable tetrahedral adducts (Figure 6). Chloro- 
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Figure 9. Mechanisms of inhibitors that form stable acyl- 
enzymes. 

methyl ketones (CMK) are potent inactivators of both 
serine and cysteine proteases, and therefore are not 
diagnostic. 120 Chloromethyl ketones cross-link serine 
proteases by forming a hemiketal with Serl95 and 
alkylating His57; 121 " 123 the reaction proceeds via an 
epoxide intermediate as shown in Figure 6. 124-127 
Interestingly, the formation of the epoxide intermedi- 
ate requires that the oxyanion can escape the oxy- 
anion hole. 

Peptidyl aldehydes, trifluoromethyl ketones, bo- 
ronic acids, and similar compounds form reversible 
tetrahedral adducts with Serl95. These complexes 
closely resemble the transition state/tetrahedral in- 
termediate of the hydrolysis reaction: His57 is pro- 
tonated and the hemiketal oxygen is negatively 
charged and bound in the oxyanion hole (Figure 6). 
The affinity of these inhibitors increases as they 
incorporate the features of good substrates; affinity 
correlates with the value of k cat /K m for the hydrolysis 
of the analogous substrate, as required for a transi- 
tion state analogue. 47 49128 Some aldehyde inhibitors 
also form an alternative adduct where the hemiacetal 
oxygen interacts with His57 rather than the oxyanion 
hole. 129-131 Because this adduct mimics the interac- 
tions of the leaving group, it can also be considered 
a transition state analogue. However, peptidyl bo- 
ronic acids can also form an adduct at Nc2 of His57 
that cannot be considered a transition state analog. 132 
These Type 2 adducts are formed when inhibitors 
resemble poor substrates or in complexes of protease 
zymogens, 133 forming higher affinity complexes than 
would be expected based on the hydrolysis of analo- 
gous substrates. 

2. Stable Acylenzymes 

Many compounds inactivate serine proteases by 
forming stable acylenzyme intermediates. This sta- 
bility derives from three general mechanisms: 

(a) The intrinsic reactivity of the acyl group. The 
reactivity of esters can be decreased by increasing 
the electron density of the carbonyl. For example, the 
deacylation rates of benzoyl chymotrypsins decrease 
as the substituent becomes more electron-donating. 134 
Similarly, alpha heteroatoms also deactivate esters, 
and acylchymotrypsins derived from azapeptides are 
stable. 135 Note that azapeptidyl acylenzymes are 
isosteric with peptidyl acylenzymes, and thus are 



expected to closely mimic the normal catalytic inter- 
mediates. 136 Of course, these effects also decrease the 
acylation rate constants, making it more difficult to 
form the acylenzyme. However, the effects on acyl- 
ation can be offset with good leaving groups such as 
p-nitrophenol. Note that such reagents as p-nitro- 
phenyl-guanidinobenzoate are active site titrants, 
reacting with trypsin to form a stable acylenzyme 
intermediate and one equivalent of p-nitrophenol 
(Figure 9). 137 Electron- withdrawing properties can 
also be disguised by more clever means and many 
classes of mechanism-based inactivators exploit this 
strategy (see Powers paper in this issue). 

(b) The acylenzyme does not interact with the 
oxyanion hole. This mechanism is illustrated by 
arylacryloyl chymotrypsin, where the planarity of the 
arylacryloyl system does not allow the carbonyl to 
enter the oxyanion hole. 71138-141 A similar mechanism 
may account for the stability of acetyl-chymotrypsin; 
this adduct lacks a large hydrophobic group that can 
bind in the SI site and direct the carbonyl oxygen 
into the oxyanion hole. 

(c) The catalytic triad is disrupted and cannot 
activate water. The simplest manifestation of this 
mechanism is the protonation of His57 so that it 
cannot serve as a general base catalyst as discussed 
above. A second example is the inactivation of chy- 
motrypsin by 3-benzyl-6-chloro-2-pyrone. 142 This com- 
pound forms a stable acylenzyme where Serl95 
attaches to CI of (2)-2-benzylpentenedioic acid. His57 
forms a salt bridge with the free carboxylate of the 
inactivator. 143 This salt bridge prevents His57 from 
performing its role as the general base in the de- 
acylation reaction. Nature has also used this strategy 
to inhibit serine proteases. Serpins form stable 
acylenzymes; this stability results from distortion of 
the active site upon complex formation 144 (see the 
Gettins paper in this issue). 

C. Issues in the Function of the Catalytic Triad 

1. Where Are the Protons? One-Proton versus 
Two-Proton-Transfer Mechanisms 

Although the general features of the charge relay 
system are widely accepted, the details of its function 
have been surprisingly controversial. 21 The issue of 
whether the proton in the Asp 102— His57 hydrogen 
bond resides on His57 or Asp 102 has been particu- 
larly contentious. Blow's original conception of the 
charge relay system explained the reactivity of 
Serl95 in terms of resonance forms Asp-C02 _ /H-His/ 
Ser-OH and Asp-CC^H/His-H/Ser-CT. 1 ' 67 This mech- 
anism was later refined to involve actual proton 
transfers from Serl95 to His57 and His57 to Aspl02, 
resulting in the tetrahedral intermediate and neutral 
Aspl02 and His57. 145 146 This two-proton-transfer 
mechanism was controversial from the outset because 
it requires that the pK a of His57 is lower than 
Asp 102, i.e., the pK a of His57 must decrease in the 
tetrahedral intermediate/transition state. Neverthe- 
less, this mechanism was incorporated into many 
textbooks. 

! H NMR experiments strongly favor the one-proton 
mechanism. The Nd 1 proton of His57 has an unusu- 
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ally low field ! H NMR signal (13.8-15 and 17-18 
ppm in neutral and protonated His57, respectively) 
which can be used to determine the pK a of His57 in 
various complexes. 147 148 The pK a of His57 is ~7 in 
the free enzyme, acylenzyme, and protein inhibitor 
complexes but increases to greater than 10 in transi- 
tion state analogue complexes. 128 149-151 Neutron dif- 
fraction experiments also demonstrated that the 
proton is associated with His57 in monoisopropyl 
phosphate modified trypsin. 152 Although these ex- 
periments do not establish the position of the proton 
in the transition state of the catalytic reaction, they 
argue strongly for the one-proton mechanism, where 
protonated His57 is stabilized by an electrostatic 
interaction with a negatively charged Asp 102. 

2. Low Barrier Hydrogen Bonds 

A more subtle version of the two-proton-transfer 
mechanism has emerged in the low barrier hydrogen 
bond (LBHB, also known as short, strong hydrogen 
bond) hypothesis of recent years. 153-159 This contro- 
versial theory developed from the observation that 
unusual hydrogen bonds form in the gas phase, 
characterized by short length (<2.6 A) and extra- 
ordinary strength (30 kcal/mol). 160161 The unusual 
properties of these hydrogen bonds is believed to 
derive from the short distance between donor and 
acceptor atoms with matched pK a 's, which eliminates 
the barrier for proton transfer so that the proton is 
shared in a broad single-potential well. The hydrogen 
bond between Asp 102 and His57 appears to fulfill 
many of the criteria for a LBHB: 162-164 (a) The 
distance between Aspl02 and His57 is <2.6 A. (b) 
The ! H NMR signal of the Ndl proton of His57 is 
unusually downfield. 21147 (c) A deuterium isotope 
effect is observed on chemical shift of the His57 Nd 1 
proton, (d) The D/H fractionation factor of the Ndl 
proton is low. 150 165 Unfortunately, these observations 
are not diagnostic for LBHBs, and the Asp-His 
hydrogen bond fails by other measures: (a) The pK a s 
of Asp 102 and His57 are not matched, (b) both 15 N-H 
spin couplings and 15 N chemical shifts indicate that 
the proton is -85% localized on His57. 158 166 (c) The 
His57— Ndl proton is not shielded from water and 
the environment is polar. 158 Moreover, both Asp 102 
and His57 are involved in multiple hydrogen bonding 
interactions; these interactions will compete with and 
weaken the Odl— Ndl— H hydrogen bond. Further, 
disruption of the catalytic triad decreases the cata- 
lytic power of serine proteases by no more than ~6 
kcal/mol, which must be distributed among several 
hydrogen bonds in addition to the Aspl02— His57 
interaction. This energy loss is consistent with a 
simple electrostatic interaction. 167 Last, mutations at 
Asp 102 can be rescued by replacing Ser214 with 
Asp. 168 Although this altered catalytic triad has not 
been thoroughly characterized, it would be surprising 
if a LBHB could be so easily reconstituted. Similarly, 
substitution of the catalytic triad Asp32 with Cys in 
subtilisin eliminates the putative LBHB, but de- 
creases kcat/K m by no more than 50-fold. 169 These 
experiments suggest that if a LBHB exists, it con- 
tributes no more than 2 kcal/mol to the catalytic 
power of serine proteases. 



The interactions of the oxyanion with the oxyanion 
hole have also been suggested to involve LBHBs. 153 
However, the two hydrogen bonds of the oxyanion 
hole provide a total of —6.5 kcal/mol in transition 
state stabilization, 87 ' 89 ' 90 149 170 which is consistent 
with simple electrostatic interactions. 171 

3. Moving His Mechanisms 

In the generally accepted mechanism for serine 
protease catalysis (Figure 5), His57 removes a proton 
from Serl95 and transfers it to the leaving group. 
This proton transfer has troubled some investigators, 
who argue that if His57 abstracts a proton from 
Serl95, then His57-H + will be positioned near Serl95 
in the tetrahedral intermediate, which would favor 
re-protonation of Serl95 and regeneration of sub- 
strate. 172173 For the reaction to proceed, His57-H + 
must be positioned to protonate the leaving group. 
The "His flip" mechanism of Figure 10 has been 
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Figure 10. The "His flip" mechanism. After initial forma- 
tion of the tetrahedral intermediate, the Ncl-H will be 
within hydrogen bonding distance of the yO of Serl95. 
Rotation of His57 supposedly places the INkl-H near the 
leaving group NH. Although the hydrogen bonds between 
Ndl-H and Aspl02 and Cel and Ser214 would be dis- 
rupted, new hydrogen bonds will form to replace them. 21 

proposed to solve this problem: 27 after formation of 
the tetrahedral intermediate, His57-H + flips, which 
should place the Ndl proton near the leaving group. 
The flipped conformation of His57 has been observed 
in subtilisin in 50% dimethylformamide. 174 In addi- 
tion, the presence of the Ser214— His57 hydrogen 
bond and the unusual dynamics of His57 have been 
taken as evidence for the "His flip" mechanism. 21 

Several considerations argue against the "His flip" 
mechanism. First, the disruption and reformation of 
so many hydrogen bonds in the short lifetime of the 
tetrahedral intermediate seems unlikely. The His flip 
mechanism appears to violate the principle of least 
motion, which suggests that enzymatic reactions 
occur with an economy of movement. 175 Moreover, 
His57 is bracketed by the P2 and PT residues of a 
peptide substrate, which creates a steric barrier to 
flipping (Figure 7). The unusual dynamics of His57 
can be explained by substrate and solvent moving in 
and out of the active site, or even the chemical 
transformation— there is no need to invoke flipping. 
Most seriously, while the flipping mechanism looks 
reasonable in two dimensions, it fails in three dimen- 
sions. Analysis of peptide trifluoroketone or boronic 
acid complexes shows that the flip actually places 
proton donor farther away from the leaving group by 
0.8 A. 

Of course, some movement of both protease and 
substrate is inevitable as the hydrolysis reaction 
proceeds. Serl95 must move at least 1 A to form the 
tetrahedral intermediate and smaller movements of 



Serine Protease Mechanism and Specificity 

the substrate will be required as the bonding changes 
from sp 2 to sp 3 . Therefore, His57 may not be oriented 
toward Serl95 in the tetrahedral intermediate as 
assumed. Even if His57 is positioned so that Serl95 
is protonated in the tetrahedral intermediate, the 
reaction is not doomed to collapse back to starting 
materials. The proton is less than 2.5 A from the 
amine leaving group, which has the higher pK a . 
Protonation of the amine is favored overall, and the 
proton needs only to transfer to the amine before the 
tetrahedral intermediate breaks down. In addition, 
several structures of transition state analogue com- 
plexes indicate that His57-H + can interact with the 
leaving group position while maintaining the Asp 102— 
His57 hydrogen bond (ref 74 and references therein). 
Whatever reorientation that may be necessary for 
completion of the catalytic cycle can probably be 
accomplished with small adjustments in both pro- 
tease and substrate and without breaking the 
Aspl02-His57 hydrogen bond. 

4. The Hydrolytic Water 

The mechanism of serine protease catalysis places 
some rather rigorous constraints on the hydrolytic 
water: it should approach acylenzyme from the 
leaving group side of the active site, it must make a 
hydrogen bond to His57 and the angle of attack must 
approximate 109 degrees. Several crystal structures 
of serine proteases contain ordered water molecules 
that appear to fulfill one or more of these criteria. 
Henderson noted the first such water in a cinnamoyl 
enzyme complex of chymotrypsin, and similarly 
located waters have appeared in subsequent acyl- 
enzyme structures. 4171 This water is 2.4 A above the 
carbonyl of the acylenzyme and also interacts with 
His57 as would be expected of the hydrolytic water, 
but the angle of attack on the carbonyl is not 
favorable. In addition, a similarly placed water 
molecule is observed in product and transition state 
analogue complexes. 176 The latter complexes mimic 
the tetrahedral intermediate, i.e., the intermediate 
formed after the water has attacked; therefore, this 
water is more likely to play a structural role. Another 
candidate water molecule is observed in a benzoyl- 
enzyme complex of trypsin, poised 3.9 A above the 
carbonyl and 3.4 A from His57. 177,178 However, this 
position is normally occupied the P2— P3 residues of 
a peptide substrate, which suggests that this water 
is not involved in the hydrolysis of peptidyl acyl- 
enzyme. A better candidate for the hydrolytic water 
has been identified in peptidyl acylenzymes of SGPA 
and elastase. 39,42 This water molecule is found ~3 A 
from His57 and ~3 A from the carbonyl of the 
acylenzyme, at an optimal angle for nucleophilic 
attack from the leaving group side of the active site. 
However, as promising as this candidate is, it must 
be remembered that these complexes are unusually 
stable acylenzymes at a pH where catalysis does not 
occur readily; the actual hydrolytic water may not 
present. 

5. Stereochemistry of the Tetrahedral Intermediate 

The stereoelectronic requirements of formation and 
breakdown of the tetrahedral intermediate pose an 
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additional conundrum. 62,173 When a nucleophile at- 
tacks an amide bond, the lone electron pairs of the 
oxyanion and the nitrogen of the leaving group must 
be antiperiplanar to the new bond (Figure 11). In a 
serine protease reaction, this stereochemistry leaves 
the lone electron pair of the amine leaving group 
pointing away from His57-H + . The nitrogen must 
undergo an inversion to position the lone pair for 
protonation. Although it is unclear how the protease 
will influence the inversion process, such inversions 
occur readily in solution. Perhaps the enzyme can use 
interactions with the oxyanion hole and S' sites to 
overcome the stereoelectronic imperative of solution 
chemistry. 62 




Figure 11. Stereoelectronic mechanism for serine protease 
catalysis. 

6. Stabilization of the Tetrahedral Intermediate/Strain in 
the Acylenzyme Intermediate 

Enzyme catalysis is most simply described in terms 
of transition state stabilization, but in practice 
interactions that stabilize the transition state will 
also strain the ground state. This idea can be 
demonstrated by considering the oxyanion hole, 
where interactions should both stabilize the tetra- 
hedral intermediate and activate the substrate for 
nucleophilic attack. 

Stabilization of tetrahedral intermediates is evi- 
dent in the characterization of transition state ana- 
logue complexes (Figure 6). The pK a of the hemiketal 
oxygen in chloromethyl ketone-inactivated chymo- 
trypsin is ^2 units lower than the pK a in solution, 170 
while the pK a of the hemiketal oxygen of trifluoro- 
ketone adduct is >5 units lower. 149 These results 
suggest that the oxyanion hole interactions stabilize 
the tetrahedral intermediate by ~6 kcal/mol. 

Unfortunately, distortion of the peptide substrate 
has been difficult to demonstrate. However, convinc- 
ing evidence for such distortion/activation of the 
acylenzyme comes from resonance Raman spectros- 
copy. Tonge and Carey measured the carbonyl stretch- 
ing frequencies of a series of arylacryloyl acylenzymes 
of chymotrypsin and subtilisin. 140,179-181 Recent work 
extends these observations to p-(dimethylamino)- 
benzoylenzymes. 182 Multiple features are observed in 
the carbonyl profile of these acylenzymes, suggesting 
that the carbonyl group has multiple conformations. 
Because hydrogen bonding decreases the stretching 
frequency of carbonyl bond, the lowest frequency 
band was assigned to the conformation of the acyl- 
enzyme with the carbonyl bound in the oxyanion hole. 
This assignment is substantiated by the following 
observations: (1) The intensity of this band increases 
with pH, displaying the same pH dependence as the 
deacylation rate constant. 140 (2) This band is absent 
in subtilisin variants where the oxyanion hole has 
been eliminated. 179 Importantly, the frequency of this 
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band (^c=o) correlates with deacylation rate for a 
series of acylenzymes. This correlation spans 4 orders 
of magnitude in deacylation rate constants (6.8 x 
10 -6 -0.3 s -1 ). 180 - 181 Further, v c =o can be related to 
carbonyl bond length, indicating that carbonyl bond 
length increases 0.025 A as the deacylation rate 
constant increases 16300-fold. This increase is ~ 1 1% 
of the change incurred upon transformation of a 
carbonyl group to a C— O bond. If this correlation is 
extrapolated to the deacylation rates observed for 
good substrates (~100 s -1 ), the carbonyl bond is 
distorted ~14% toward a C-0 bond. Last, model 
systems indicate that the change in AH associated 
with the shifts in vq=o is 6.4 kcal/mol, similar to the 
changes in transition state stabilization of 5.7 kcal/ 
mol over the range of compounds. There is sufficient 
energy in the hydrogen bonding interactions of the 
oxyanion hole to provide this transition state stabi- 
lization. Similarly, multiple ester bands are also 
observed in Fourier transform infrared spectroscopy 
of cinnamoyl-chymotrypsins, and have likewise been 
attributed to multiple conformations of the carbonyl 
group. 183 This work provides further evidence for 
distortion of the carbonyl bond in the acylenzyme. 
Interestingly, the ester carbonyl frequency correlates 
with dielectric constant in model compounds. This 
correlation suggests that the carbonyl of the acyl- 
enzyme experiences a dielectric constant of 70 in the 
oxyanion hole conformation and 40 in the nonproduc- 
tive conformation. 

It appears that the function of the catalytic triad 
and oxyanion hole can be explained by the electro- 
static complementarity of the active site and transi- 
tion state. 167 The preorganization of the enzyme 
active site provides a substantial advantage relative 
to the uncatalyzed reaction where solvent molecules 
must organize to stabilize the developing charges as 
the reaction proceeds. 262 Warshel has described this 
phenomenon as solvation substitution rather than 
desolvation. 167 This preorganization is paid for by the 
folding energy of the enzyme. 263 

IV. Kinetics of the Serine Protease Reaction 

The serine protease reaction is generally consid- 
ered to have three-step kinetic mechanism (shown 
in Figure 12): (a) formation of an enzyme— substrate 
(E*S) complex (K s = k-i/k\)\ (b) acylation of the active 
site serine (k 2 ); and (c) hydrolysis of the acylenzyme 

<j) *, o k 2 9 *s 9 

E -° H+ R^X = E A BVS ""ST E -° H+ R^OH 

X- 

k cat = kji^{k 2 + k 3 ) 
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If k 3 » k 2 , 
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Figure 12. The kinetic mechanism of serine proteases. 



intermediate (£3). Although the three-step mecha- 
nism of Figure 12 is widely accepted, it is important 
to note that some of its key features have not been 
rigorously tested. First, both k 2 and k 3 are composite 
rate constants that include a chemical transformation 
and a product dissociation step. It is generally 
assumed that the product dissociation step is fast 
relative to the chemical transformation. The evidence 
usually cited to support this assumption is the 
presence of a large solvent kinetic isotope effect (~3) 
on Acat, which is consistent with rate-limiting chemi- 
cal transformation. 184 However, solvent isotope effects 
can be difficult to interpret because the solubility of 
substrates and products, the viscosity of the solution 
and even the structure of the protein can change in 
D2O, so that the isotope effect may not be definitive 
proof of rate-limiting chemistry. 185 Second, the mech- 
anism indicates that dissociation of the leaving group 
must occur prior to hydrolysis of the acylenzyme. 
Again, there is little evidence on this point. Acyl- 
transfer experiments indicate that hydrolysis can 
occur from an acylenzyme— nucleophile complex, 
which appears to contradict this assumption. 97 186-188 
However, these acylenzyme— nucleophile complexes 
may be nonproductive complexes that do not form 
during the normal catalytic cycle. For example, 
during the hydrolysis of a peptide substrate, the 
leaving group would be constrained to interact at the 
ST site. In contrast, an added nucleophile could form 
a nonproductive complex at another S' site that would 
not impede hydrolysis of the acylenzyme. Thus, the 
order of hydrolysis and product release is not well 
established. 

Several other assumptions are frequently made 
when evaluating the kinetics of serine protease 
reactions. The Michaelis-Menten parameters Acat, 
iC m , and kcat/K m are composites of the rate constants 
as shown in Figure 12. In particular, the expression 
for K m includes A3 as well as Ai, A-i, and k 2 . Increasing 
values of k 2 will increase K m until k 2 > k 3 , but further 
increasing k 2 will decrease K m (Figure 12), so that 
K m can be exceedingly difficult to interpret with 
respect to substrate affinity. In addition, the expres- 
sions in Figure 12 are frequently simplified by 
assuming k-\ » k 2 . This assumption is probably valid 
for poor substrates, but the hydrolysis of good sub- 
strates is often diffusion controlled, which indicates 
k 2 > A-i. 189-191 

The assertion that acylation is rate-limiting for 
amide hydrolysis, i.e., A 3 » k 2 , is by far the most 
indiscriminately applied assumption. Seminal work 
from Bender and co-workers demonstrated that k 2 
is the rate-limiting step for the hydrolysis of Af-acetyl- 
L-tryptophanamide, while A 3 is rate limiting for the 
hydrolysis of N- acetyl-L- tryptophan esters. 59 These 
observations have been generalized to statements 
such as "acylation is rate determining for amide 
substrates and deacylation is rate determining for 
ester substrates of serine proteases", as may be found 
in many textbooks. This statement has gained wide 
acceptance because amides are intrinsically much 
less reactive than esters. However, Af-acetyl-L-trypto- 
phanamide is a very poor substrate for chymotrypsin, 
with k^/Km = 3.6 M -1 s -1 and Acat = 0.026 s -1 . 
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Enzymes often process poor substrates with different 
rate-determining steps than good substrates. Chymo- 
trypsin and trypsin hydrolyze peptides with values 
of AcatAKm in the range of ~10 7 M -1 s -1 and kc at ~ 100 
s -i 192,193 Michaelis-Menten parameters of this order 
of magnitude are typical for the reactions of serine 
proteases; thrombin and tissue plasminogen activator 
hydrolyze their natural substrates with kcJK^s « 
10 7 M _1 s -1 . 194 - 195 Even Factor D, a protease with 
notoriously low activity toward oligopeptide amide 
and ester substrates, cleaves its natural protein 
substrate with k^ t IK m > 2 x 10 6 M -1 s -1 . 196 - 197 Given 
that the values of k cat IK m for peptide hydrolysis can 
be 10 6 -fold greater than the amide substrates used 
in Bender's work, the acylation step should not be 
assumed to be generally rate-limiting. 

Indeed, several experiments suggest that k 2 is not 
always rate-limiting for amide hydrolysis. Compa- 
rable values of kc at have been reported for the 
hydrolysis of analogous oligopeptide amides and 
esters. 192,198-201 Such observations generally suggest 
that a common step is rate-limiting, i.e., hydrolysis 
of the acylenzyme intermediate. However, inferring 
individual rate constants from steady-state param- 
eters is a precarious practice. The most convincing 
case for rate-limiting deacylation in amide hydrolysis 
is the observation of bursts of product formation 
under pre-steady state conditions. Such bursts are 
observed in the hydrolysis p-nitroanilidamides by 
human leukocyte elastase and in the hydrolysis of 
methylcoumarinamides by Kex2 protease and 
furin. 202-204 Similar bursts have been observed in the 
hydrolysis of peptide methylcoumarinamides by 
trypsin (X. Liu and L. Hedstrom, unpublished experi- 
ments). Most convincingly, a burst is observed in the 
hydrolysis of a peptide substrate by Kex2. 205 Thus, 
the generalization "A 2 is rate determining for the 
hydrolysis of amide substrates by serine proteases" 
is not warranted. 

The above work is frequently criticized because the 
p-nitroanilides and methylcoumarinamides leaving 
groups are more reactive than a peptide bond. 
However, amide hydrolysis has little dependence on 
leaving group (J3\ g = 0.07) and the plot of the values 
of kou for the hydrolysis of both amides and anilides 
fall on a single line when plotted against pAa of the 
leaving group. 206 While p-nitroanilides are exception- 
ally reactive in solution, 207 this unusual reactivity is 
not recapitulated in protease reactions. The hydroly- 
sis of anilides, including p-nitroanilides, by chymo- 
trypsin displays no dependence on the pK a of the 
leaving group (most likely due to general base 
catalysis). Moreover, peptide bonds are wore reactive 
than anilides with leaving groups of comparable 
pAa. 110 Therefore, it is unlikely that the results cited 
above can be attributed to the unusual reactivity of 
the leaving groups. These observations further dem- 
onstrate that the rate-determining step of an amide 
substrate should not be automatically assigned to 
acylation. 

Several methods have been used to measure the 
values of the individual rate constants k\, k-\, k 2 , and 
k 3 . Surprisingly, although pre-steady state and single 
turnover experiments are the best methods for de- 
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termining individual rate constants, they have only 
rarely been applied to serine proteases, and usually 
confined to the analysis of ester hydrolysis (for 
examples, see refs 208-211). A recent study has 
applied these methods to the analysis of the hydroly- 
sis of peptide p-nitroanilides by elastase. 202 The 
individual rate constants are more commonly deter- 
mined by characterizing the hydrolysis of analogous 
amide/ester pairs. 59 199 ' 200 Deacylation is usually rate- 
limiting in the hydrolysis of esters, so that Acat.ester = 
k 3 . While this fact is often taken for granted, it can 
be substantiated if the reactions of a set of analogous 
esters with different leaving groups have the same 
kcat, this observation suggests that a common step is 
rate-limiting, i.e., deacylation. Likewise, an analo- 
gous amide substrate must form the same acyl- 
enzyme, so that k 3 must be the same as in the ester 
reaction. The values of K s and k 2 can then be 
determined from the equations in Figure 12 (with the 
assumption that k-\ » k 2 ). The values of K s , k 2 , and 
k 3 can also be determined using acyltransfer reac- 
tions under conditions where the added nucleophile 
does not form a complex with the acylenzyme. 94,212 
This method is especially useful in evaluating the 
hydrolysis of substrates where k 2 > k 3 . The analysis 
of the viscosity dependence of k c JK m can yield values 
of k\ and the ratio of A-iAfe. 189 This method must be 
applied with caution because it involves changing 
solvent, with attendent changes in dielectric constant 
and the possibility of other nonspecific effects, and 
therefore requires careful attention to controls. In- 
dividual rate constants have also been determined 
by analyzing the temperature dependence of the 
Michaelis-Menten parameters. 191 This method can- 
not identify the rate-limiting step and generally 
operates under the assumption that k 3 » k 2 . Using 
these various methods, values of k 2 of 10 2 — 10 4 s -1 
for good ester substrates and 10—1000 s -1 for good 
amide substrates have been reported. The value of 
k 3 is usually —50 s -1 for good substrates. Obviously, 
the values of k 2 and k 3 can be much lower if the 
substrate is not optimal. The values of k\ and k-\ are 
typically 10 6 -10 8 M -1 s -1 and 5-500 s"\ respectively. 

V. Substrate Discrimination during Catalysis 
A. The Hallmarks of Serine Protease Specificity 

Specificity is defined by how substrates compete 
for an enzyme, and is measured by the value of k cat / 
K m . The Pl/Sl interaction controls specificity, as 
illustrated with trypsin, where the value of kc at /K m 
varies over 10 5 -fold as the Pl/Sl interaction is 
optimized (Table 3). This increase results mainly 
from an increase in k^t (10 4 -fold) rather than K m (10- 
fold). Interactions at the S2— Sn sites also strongly 
influence specificity, as illustrated with elastase 
where the value of k ca t/K m increases 10 5 -fold as the 
additional P residues are added (Table 3). Again, the 
increase results mainly from an increase in k ca t (10 3 - 
fold) rather than K m (100-fold). Similarly, P7S' in- 
teractions increase k^lK^ once again via an increase 
in Acat (Table 3). The effect of the P7S' interactions 
can be most dramatically illustrated by the inactiva- 
tion of chymotrypsin by carbonate esters, where 
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Table 3. The Hydrolysis of Substrates by Rat Trypsin and Porcine Pancreatic Elastase 



substrate 


WAmfM^S- 1 ) 


Acat (S" 1 ) 


Kn («M) 




Rat Trypsin 6 






Suc-Ala-Ala-Pro-Arg-AMC 


1.3 x 10 6 


91 


70 


Suc-Ala-Ala-Pro-Lys-AMC 


1.2 x 10 6 


120 


100 


Suc-Ala-Ala-Pro-Tyr-AMC 


3 


6.3 x 10" 4 


200 


Suc-Ala-Ala-Pro-Phe-AMC 


8 


6.3 x 10" 3 


780 




Porcine Pancreatic Elastase 






Ac-Ala-NH 2 C 


<0.05 


< 0.008 


160 {Ki) 


Ac-Pro-Ala-NH 2 c 


0.07 


0.007 


100 


Ac-Ala-Pro-Ala-NlV 


25 


0.1 


4.2 


Ac-Pro-Ala-Pro-Ala-NH^ 


3700 


9.3 


2.5 


Ac-Ala-Pro-Ala-Ala-NH^ 


520 


2.4 


4.6 


Ac-Pro-Ala-Pro-Ala-Ala-NH/ 


25,500 


37 


1.4 




Human Leukocyte Elastase e 






Cbz-Val-pNP 


480 






Cbz-Phe-pNP 


65 






Boc- Ala- Ala- Val- SBzl 


3.0 x 10 6 






Boc-Ala-Ala-Phe-SBzl 


<10 







a "— " denotes cleaved bond; AMC, aminomethylcoumaride; pNP, p-nitrophenyl; SBzl, thiobenzyl. 6 Data from ref 218. c Data 
from ref 260. d Data from ref 261. e Data from ref 215. 



Table 4. Inhibition of Chymotrypsin by Carbonate 
Esters 3 



Table 5. The Specificity of Binding, Acylation, and 
Deacylation 3 



E-OH + 



E-O' 



+ XH 



leaving group X 


Aon (M 1 S 1 ) 


OCH 3 


9 


0-Ala-OCH 2 CH 3 


200 


0-Ala-Leu-OCH 3 


2,400 


0-Ala-Leu-Arg-OCH 3 


>23,000 


a Data from ref 213. 





interactions with ST- S3' sites increase the rate of 
inactivation > 2000-fold (Table 4). 213 

Importantly, the SI site and the remote binding 
sites act in concert to determine serine protease 
specificity. 200 - 214 - 215 While the value of hJK^ in- 
creases dramatically with peptide length, this in- 
crease requires the correct PI— SI interaction. 52216 
This phenomenon can be illustrated with the hy- 
drolysis of />nitrophenylesters by human leukocyte 
elastase (HLE) (Table 3). 215 HLE displays little 
specificity in the hydrolysis of single residue esters, 
with the PI— Val substrate preferred by only 7-fold 
over the PI— Phe, even though the SI site of HLE 
should not accommodate large residues. The addition 
of two peptide residues increases the hydrolysis of 
the PI— Val substrate by >10 3 , but decreases the 
hydrolysis of the Phe-containing substrate. Likewise, 
increasing peptide length dramatically increases the 
value of kcat/Km in the trypsin-catalyzed hydrolysis 
of substrates containing P 1 - Arg or Lys, but not P 1 - 
Phe. 200 Similar results are observed for other serine 
proteases. 

B. Specificity Is Determined by the Binding and 
Acylation Steps 

As noted above, kca t /K m depends only on the binding 
and acylation steps k\, k-\ 9 and k 2 (Figure 12). From 
an intuitive perspective, if two substrates compete 
for protease, the competition is "won" when acylation 



substrate 



kcatlKm 

(M^s 1 ) 



As 
(«M) 



(s- 1 ) 



£3 

(s- 1 ) 



75 


788 


0.06 


10 


575 


722 


0.42 


11 


56,000 


810 


43 


13 


psin II 








4000 


96 


0.7 


65 


1 x 10 7 


>50 


>1300 


56 


1500 


220 


0.32 


130 


30,000 


6500 


280 


18 


0.2 


1000 


0.0002 


30 


6 


1000 


0.007 


60 



Human Leukocyte Elastase 6 
MeOSuc-Val-pNA 
MeOSuc-Pro-Val-pNA 
MeOSuc-Ala-Pro-Val-pNA 



Z-Arg-AMC C 
Tos-Gly-Pro-Arg-AMC c 
Z-Lys-AMC C 
D-Val-Leu-Lys-AMC C 
Ac-Phe-NH 2 rf 

Suc-Ala-Ala-Pro-Phe-AMC^ 

a Mechanistic rate constants as defined in Figure 12. "— " 
denotes cleaved bond; pNA, p-nitroanilide; AMC, aminomethyl- 
coumaride. b Data from ref 52. c Data from ref 243. d Data from 
ref 200. 

occurs and deacylation is irrelevant. Since the in- 
crease in the values of k cat /K m is derived from an 
increase in k^ rather than a decrease in K m , specific- 
ity must be primarily determined by acylation. This 
conclusion is borne out when the steady-state kinetic 
parameters are deconvoluted into K s and k 2 (Table 
5). The value of k 2 varies by more than 10 3 x between 
good and poor substrates while only 10— 100-fold of 
substrate discrimination derives from substrate af- 
finity. 52 ' 200 ' 202 - 216 It is especially noteworthy that long 
peptide substrates bind with approximately the same 
affinity as short substrates. 

Although £3 is not a component of k cat /K m , it should 
not be completely ignored. If the value of k 3 is very 
small, the acylenzyme will be very stable, substrate 
turnover will be very slow and the enzyme will be 
essentially inactivated. The deacylation step is there- 
fore a critical determinant of substrate turnover. The 
deacylation step also varies among substrates, al- 
though much more dramatic changes in the substrate 
are required to affect £3. Interestingly, while the 
values of k 2 for PI— Arg containing substrates are 
100-10 5 -fold great than PI -Phe, the values of K s are 
< 10-fold less and the values of k 3 are similar (Table 
5). Thus, the preferences of the binding, acylation and 
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deacylation steps are different even though these 
steps occur on the same enzyme surface. 

C. The Contribution of Substrate Association to 
Specificity 

Inspection of the algebraic expression for k ca t/K m 
reveals another important mechanistic fact: as the 
value of ki increases, k-\ will become insignificant, 
kat/K m = k\ and the reaction will be diffusion con- 
trolled. If the value of k\ was determined by the 
simple collision of substrate and protease, then k\ 
would be similar for all substrates, i.e., ~10 8 M" 1 s" 1 . 
However, values of k\ of 10 6 — 10 7 M" 1 s" 1 have been 
reported, which suggest that substrate association is 
more complicated than a simple collision. Obviously, 
the substrate/protease interface has many hydrogen 
bonding and van der Waals interactions which seem 
unlikely to form in a single step. Some evidence for 
isomerization steps has been reported. 52 - 56 One might 
expect that the value of k\ should decrease with 
peptide length since more interactions must form. 
The value of k-i should also decrease, since more 
interactions would have to be disrupted when the 
substrate dissociates. If the values of k\ and k-\ 
decrease in parallel with peptide length, then the 
overall affinity will remain constant. This idea can 
be illustrated with the inhibition of chymotrypsin by 
trifluoroketones. Ac-Leu-Phe-TFK is a more potent 
inhibitor of chymotrypsin than Ac-Phe-TFK (Ki = 0.5 
and 4.5 nM, respectively; values are corrected for free 
ketone concentration). 49 Ac-Leu-Phe-TFK has a lower 
value of k x than Ac-Phe-TFK (4 x 10 6 and 3 x 10 7 
M" 1 s" 1 , respectively), and a lower value of k-\ (0.002 
and 0.13 s -1 , respectively). The structure of the 
chymotrypsin does not change appreciably in the two 
inhibitor complexes, suggesting the kinetics of Ac- 
Leu-Phe-TFK inhibition are slower because more 
enzyme-inhibitor contacts must form. 74 

When a reaction is diffusion controlled, the con- 
formation and solvation of the substrate and protease 
will determine the rate of substrate association, and 
hence of specificity. These considerations illustrate 
why the structures of both free enzyme and free 
substrate must be known to assess the basis of 
specificity. Unfortunately, the structure of the sub- 
strate in solution is rarely considered in any enzy- 
matic reaction. Several puzzling observations about 
serine protease specificity might be explained by the 
solution conformation of peptide substrates. For 
example, the P3 residue appears to be a major 
determinant of specificity even when it makes few 
contacts with the enzyme. Large P3 residues may 
favor a more extended peptide conformation that 
would favor association. The surprising observation 
that the substitution of ester bonds at the P4— P3 and 
P3-P2 amide bonds has little effect on the value of 
hJK m in trypsin reactions 46 might also be explained 
by changes in solution conformation and/or solvation. 

D. Implications for the Specificity of Ester 
Hydrolysis 

The specificity of ester hydrolysis may be domi- 
nated by substrate association. Once bound in the 



active site cleft, the intrinsic reactivity of an ester 
ensures acylation and progression through the cata- 
lytic cycle. This condition almost certainly holds for 
p-nitrophenylesters, where acylation is so facile that 
the reaction is concerted. Thus, the specificity of ester 
hydrolysis will underestimate the potential of a 
protease to discriminate between peptide substrates. 
The validity of this idea is illustrated in the hydroly- 
sis of "inverse substrates". Trypsin hydrolyzes p- 
amidinophenyl acetate, where the leaving group 
binds in the SI site, almost as well as normal ester 
substrates (kc at /K m = 10 5 versus 10 7 M" 1 s" 1 ) (Figure 
13). 217 Similarly, the facile reaction of p-nitrophenyl 
guanidinobenzoate with chymotrypsin can also be 
explained by an inverse orientation, where the p- 
nitrophenyl leaving group binds in the S 1 site and 
the guanidino group interacts with ST. 

h 2 n^TA 0 X CH H2N >nfyoH * ch 3 co 2 h 

Figure 13. Inverse substrates of trypsin. 

When the reaction is diffusion controlled, the rate- 
limiting transition state will not involve formation 
of the tetrahedral intermediate as generally sup- 
posed; instead the rate-limiting step will be formation 
of the E-S complex. The general features of this 
transition state might include the canonical substrate 
conformation as well as interactions at the various 
subsites. Many investigators have assumed that 
protein inhibitors, which are locked in the canonical 
conformation, form complexes that mimic ground- 
state E-S complexes. However, the values of K { for 
protein inhibitors correlate with values of kc at /K m for 
the analogous substrates, a feature of transition state 
analogues but not ground-state inhibitors. 38 The 
canonical conformation may be a high energy con- 
formation that activates the scissile bond and thus 
may be the transition state-like feature of protein 
inhibitors. 

VI. Redesigning the 

Trypsin-Chymotrypsin- Elastase Paradigm 

Textbooks often illustrate enzyme specificity with 
the pancreatic serine proteases trypsin, chymo- 
trypsin, and elastase: trypsin, with Asp 189, cleaves 
after PI Arg/Lys residues; chymotrypsin, with Serl89, 
cleaves after PI Phe/Tyr/Trp; and elastase, with 
Val216 and Thr226, cleaves after PI Ala/Val. The 
trypsin- chymotrypsin- elastase paradigm suggests 
that protease specificity is determined by a few 
structural elements. However, the obvious mutations 
fail to transfer the specificity of one protease into 
another. 

A. Redesigning Amidase Activity 

Switching these elements from one framework to 
the next should be sufficient to change specificity. 
However, the substitution of Asp 189 with Ser does 
not convert trypsin into protease with specificity for 
Pl-Phe (Table 6). 200 - 218 ' 219 Trypsin does acquire chy- 
motrypsin-like specificity when the residues of the 
SI site and two surface loops are replaced with their 
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Table 6. Mechanistic Kinetic Parameters for Amide 
Hydrolysis by Chymotrypsin, Trypsin, and Trypsin 
Mutants* 

itcat/Am K s k 2 k 3 

substrate (M^s" 1 ) (M) (s 1 ) (s" 1 ) 



Chymotrypsin 



Ac-Phe-NH 2 


18 


0.023 


0.43 


60 


Suc-AlaAlaProPhe-AMC 


4.9 x 10 5 


2.5 x 10" 4 


12 


52 


Suc-AlaAlaProPhe-pNA 


5.6 x 10 5 


1.5 x 10 3 


850 


52 




Trypsin 








Ac-Phe-NH 2 


0.2 


1 x 10" 3 


2 x 10" 4 


30 


Suc-AlaAlaProPhe-AMC 


6.0 


1.1 x 10" 3 


0.007 


36 


Suc-AlaAlaProPhe-pNA 


9 


> 0.25 


> 0.2 


36 




D189S 








Ac-Phe-NH 2 


0.1 


0.17 


0.018 


39 


Suc-AlaAlaProPhe-AMC 


38 


2.8 x 10" 3 


0.10 


33 


Suc-AlaAlaProPhe-pNA 


21 


0.15 


0.29 


33 


Tr-Ch[Sl+Ll+L2] 






Ac-Phe-NH 2 


0.3 


0.065 


0.018 


33 


Suc-AlaAlaProPhe-AMC 


800 


>0.006 


>4 


37 


Suc-AlaAlaProPhe-pNA 


1.7 x 10 3 


0.011 


20 


37 


Tr — Ch[Sl+Ll+L2+Y172W] 






Ac-Phe-NH 2 


0.7 


>0.1 


>0.08 


38 


Suc-AlaAlaProPhe-AMC 


9.3 x 10 3 


> 0.006 


>60 


63 


Suc-AlaAlaProPhe-pNA 


8.3 x 10 4 


5 x 10" 4 


41 


63 



a Data from ref 220. 
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Figure 14. The transfer of P 1 — Phe specificity into trypsin. 
The values of As, kz, and A3 are plotted versus kcatK m 
for the hydrolysis of Suc-Ala-Ala-Pro-Phe-AMC by 
trypsin, Aspl89Ser trypsin, Tr — Ch[Sl+Ll+L2], Tr — 
Ch[Sl+Ll+L2+Y172W] and chymotrypsin. The acylation 
rate A 2 increases as kcJK m increases, but the values of K s 
and A3 remain constant. 

counterparts from chymotrypsin. These substitutions 
are Aspl89Ser, Glnl92Met, insertion of Thr219, 
Ilel38Thr, the 185 loop (residues 185-188) and the 
220 loop (residues 221-224). This mutant protease, 
Tr — * Ch[Sl+Ll+L2], has lOOx more activity than 
D189S, and has 0.5% of the activity of chymotrypsin 
(Table 6). An additional substitution, Tyrl72Trp, 
increases activity to within 15% of chymotrypsin. 220 
The installation of chymotrypsin-like activity into the 
trypsin framework is almost entirely the result of 
acceleration of the acylation step, which increases by 
~10 4 -fold (Figure 14). Deacylation is comparable in 
all of the enzymes. However, none of the mutant 
proteases bind substrates well. 

Although these mutations focused on the S 1 site, 
the increase in acylation originates in the interac- 
tions of the more remote sites. For chymotrypsin, the 
value of k 2 is —400-fold larger for a long substrate 
than a short substrate. In contrast, for D189S 



trypsin, the values of kz for the hydrolysis of 
short and long substrates are similar (Table 6). 
Thus, unlike chymotrypsin, D189S trypsin cannot 
utilize binding in the remote sites to increase acyl- 
ation. However, in Tr -* Ch[Sl+Ll+L2] and Tr — * 
Ch[Sl+Ll+L2+Y172W], acylation increases by > 10 3 - 
fold between short and long substrates. Thus, the 185 
and 220 loops mediate the translation of remote 
binding interactions into catalysis. 

Why is Aspl89Ser trypsin a poor nonspecific pro- 
tease? First, the substitution of Asp 189 with Ser 
deforms the SI site and the activation domain (the 
region that rearranges during zymogen activa- 
tion). 219,221 This is the all too typical protein engineer- 
ing result: mutation leads to unanticipated defects 
in structure. Interestingly, BPTI induces D189S to 
assume a trypsin-like conformation, masking these 
structural defects 222 and underscoring the need to 
evaluate both free and bound enzymes and substrates 
when assessing the structural basis of specificity. The 
substitution of Asp 189 with His deforms the SI site 
even in the presence of BPTI, further illustrating the 
inherent fragility of the SI site. 223 These observations 
suggest that mutation of Asp 189 might be sufficient 
to change specificity if only the SI site was more 
structurally sound. 

How do the 185 and 220 loops and Tyrl72Trp 
substitution influence catalysis? The Tyrl72Trp mu- 
tation helps rebuild the SI site in the presence of 
Serl89. The SI site of Tr — Ch[Sl+Ll+L2+Y172W], 
unlike D189S trypsin or Tr — Ch[Sl+Ll+L2], is 
sufficiently structured to allow proflavin binding. The 
structures of the chloromethyl ketone-inactivated 
complexes of these proteases also suggest that the 
Tyrl72Trp mutation stabilizes the SI site. 16 Overall, 
the structures of both Tr — Ch[Sl+Ll+L2] and Tr 
— Ch[Sl+Ll+L2+Y172W] resemble chymotrypsin, 
but the 185 and 220 loops are disordered in both 
mutants. Tr — Ch[Sl+Ll+L2+Y172W] is more 
structured than Tr — - Ch[Sl+Ll+L2], in keeping 
with its higher activity. The substitution of the 185 
and 220 loops are probably required to construct an 
SI site containing Ser 189. 

The fragility of the S 1 site is not the whole story. 
The substitution of the 185 and 220 loops have a 
subtle effect on the polypeptide binding site: the 
conformation of Gly2 1 6 changes so that the carbonyl 
points toward the P3 NH in a better orientation for 
the hydrogen bond. The function of the polypeptide 
binding site may be mediated by the conformation 
of Gly216, and therefore by the 185 and 220 loops. 15 
The loops may also modulate the dynamic properties 
of the active site cleft. Gly216 and the analogous 220 
loop also appear to be important determinants of 
alpha lytic protease specificity, where strong evidence 
exists for the involvement of dynamic processes (see 
below). 224 - 225 

The SI site, 185 and 220 loops, and residue 172 do 
not define a universal set of structural determinants 
of specificity in serine proteases. This set of substitu- 
tions fails to install trypsin-like specificity into chy- 
motrypsin, 226 nor do they install an elastase-like 
specificity into trypsin. 227 These results suggest that 
a different structural solution is required for each 
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specificity. In addition, alternate solutions to the 
trypsin-to-chymotrypsin conversion may exist. Sta- 
bilization of the activation domain would also stabi- 
lize the S 1 site. Bacterial serine proteases do not use 
the zymogen activation mechanism of mammalian 
serine proteases, which suggests that other strategies 
exist for stabilizing the SI site. This strategy is 
probably the mechanism operating a metal-activated 
Aspl89Ser trypsin variant. The substitution of Asnl43 
and Glu 151 with His creates a metal binding site in 
trypsin. 228 This metal site can also interact with a 
P2' His residue, thus installing specificity for His at 
the S2' site. These mutations also increase the 
activity of Aspl89Ser trypsin by 300-fold. Asnl43 and 
Glul51 are also part of the activation domain; metal 
binding will stabilize the activation domain, and by 
extension, the SI site, which accounts for the increase 
in activity. 

Unlike the SI site, the SI' and S2' sites can be 
redesigned easily. A single substitution, Lys60Glu, 
is sufficient to create a trypsin variant with a ~ 100- 
fold preference for Arg at ST. 229 The construction of 
a metal binding site at S 2' as described above was 
sufficient to create specificity for P2 His. 223 Why are 
the S 1 sites of trypsin and chymotrypsin so difficult 
to redesign, especially when compared with the ST 
site? Part of the answer is undoubtedly in topology 
of the sites. The SI site is a deep pocket which 
surrounds the PI residue. A cavity is an intrinsically 
unstable structure and less able to tolerate substitu- 
tions. In contrast, the ST site is a shallow groove. 
Moreover, the SI site of trypsin is part of the 
activation domain of trypsin. The SI site has two 
conformations, the ordered structure of the mature 
protease and an inactive, zymogen-like conformation. 
It is not surprising that the Aspl89Ser mutation 
causes complete deformation of the S 1 site, pushing 
the conformational equilibrium to the zymogen-like 
form. 221 - 230 In effect, the SI site is "built to collapse". 
In contrast, the structure of the ST site remains 
constant during zymogen activation, and is inher- 
ently more structurally sound than the SI site. The 
stability of the framework is obviously an important 
determinant of the ability to redesign an enzyme. 

A well-ordered SI site may not be an absolute 
requirement for high protease activity, so long as the 
conversion to a catalytically competent structure 
does not invoke a huge energetic penalty. Tr — 
Ch[Sl+Ll+L2] is almost as active as chymotrypsin 
although its SI site is deformed. However, the rigid 
SI site of chymotrypsin is almost certainly a require- 
ment for survival in its physiological niche— the 
protease-rich duodenum. It is worth remembering 
that enzymes evolve to optimize all aspects of their 
physiological functions, including catalysis, expres- 
sion, translation, folding, and degradation. 

B. Redesigning Esterase Activity 

In contrast to amide hydrolysis, the esterase speci- 
ficity of the SI site can be easily re-engineered. The 
Aspl89Ser mutation is sufficient to change the 
specificity of trypsin to that of chymotrypsin in ester 
hydrolysis. 200 Trypsin has been converted into an 
elastase-like esterase, but this enzyme has no meas- 
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urable amidase activity. 227 Similarly, the preference 
of granzyme B for acidic P 1 residues can be changed 
to Phe with the substitution of Arg226 with Gly 
(unfortunately, this work did not utilize purified 
enzymes, and therefore must be included with cau- 
tion). 231 The ease with which esterase activity can be 
manipulated derives from the much greater intrinsic 
reactivity of esters as discussed above. 

Mil How Do Remote Interactions Translate into 
Catalysis? 

The operation of the catalytic machinery in the 
immediate vicinity of the scissile bond appears very 
straightforward: Aspl02, His57, and Serl95 and the 
oxyanion hole are aligned to facilitate proton trans- 
fers and stabilize the charges that develop as the 
reaction proceeds. It is also clear that a network of 
hydrogen bonds propagates outward from the cata- 
lytic triad, so that changes in bonding order and 
charge distribution at the scissile bond will propagate 
to more remote enzyme-substrate interactions, and 
vice versa. Interactions far from the scissile bond, in 
the SI site, polypeptide site, S2— Sn or ST— Sn', can 
dramatically and cooperatively accelerate the hy- 
drolysis of substrate. Such interactions must optimize 
the function of the catalytic machinery. 

The influence of remote binding interactions on the 
catalytic triad is illustrated by trifluoroketone inhibi- 
tion. The His57 Ndl protons have low-field chemical 
shifts, elevated piC a 's, low fractionation factors and 
slow rates of D2O exchange. 149 150 165 For a series of 
inhibitors varying in the P2 residue, these properties 
correlate with the affinity of the inhibitor as well as 
hJK m for the analogous substrate. 150 151 163 165 - 232 Of 
course, the big question is how do these remote 
interactions translate into catalysis. Most remote 
interactions do not increase substrate affinity but 
contribute exclusively to catalysis. This observation 
suggests that these interactions introduce strain into 
the substrate, which penalizes binding affinity but 
accelerates catalysis. This strain can take many 
forms, including desolvation of enzyme and substrate, 
locking the substrate in the canonical conformation, 
interactions of the carbonyl with the oxyanion hole, 
damping of protein motions, conformational changes 
and/or geometric distortion. What follows are some 
possible mechanisms, in no particular order. These 
mechanisms are not mutually exclusive, and it is 
likely that several are operating. 

A. Remote Interactions Align the Substrate in the 
Active Site 

One proposal is that remote binding interactions 
align the substrate relative to the catalytic triad, 
placing the substrate in catalytic register. 15 This 
mechanism is particularly attractive when consider- 
ing the protease subsites immediately adjacent to the 
scissile bond, i.e., S2, SI, and ST. These interactions 
dock the scissile bond next to Serl95 and the oxy- 
anion hole. Unfortunately, the alignment mechanism 
becomes less attractive as the interactions get farther 
from the site of chemical transformation. Once the 
Pl/Sl and P2/S2 interactions are formed, the sub- 
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strate is tethered in position; there would seem to 
be little gained for interactions beyond the S2 site. 
Another mechanism would appear necessary to har- 
ness the binding energy at these more remote sites. 

Any explanation for how distal interactions ac- 
celerate protease catalysis must also explain the how 
these interactions can act "in trans". For example, 
amines or guanidinium compounds accelerate the 
hydrolysis of substrates containing PI -Gly or Ala by 
trypsin. 233 - 234 This observation suggests that the PI — 
SI interaction does not simply dock the substrate in 
the active site cleft. Ethylamine could trigger a subtle 
conformational change that activates the enzyme. 
The PI /SI interaction may prevent nonproductive 
binding modes, thus indirectly improving the align- 
ment of the substrate in the active site. However, the 
alignment mechanism does not appear to be suf- 
ficient to explain the effects of the remote binding 
interactions on catalysis. 

B. Remote Interactions Are Optimized in the 
Transition State 

Enzymes are often described as templates for the 
transition state of a reaction. Conventional views of 
the transition state of protease reactions focus on the 
site of bond making/breaking and formation of the 
tetrahedral adduct. However, if remote interactions 
increase catalysis, then remote interactions must also 
be part of the transition state. Therefore, a complete 
description of the transition state must include the 
P/S and P7S' interactions in addition to the tetra- 
hedral adduct, catalytic triad and oxyanion hole. The 
most dramatic changes in structure are involved in 
the bond making and breaking at the scissile bond, 
while smaller changes result from the changes in 
geometry in the transformation from planar sub- 
strate to tetrahedral transitions state. These small 
changes can propagate into significant displacements 
down the peptide chain, optimizing the remote bind- 
ing site interactions in the transition state. 235 Such 
changes could provide a mechanism for the rate 
acceleration. Support for this mechanism can be 
found in the inhibition of elastase by peptidyl alde- 
hydes. 47 While the P4 residue increases the affinity 
of an alcohol inhibitor by 10-fold, the affinity of the 
corresponding aldehyde inhibitor increases by 100- 
fold (Table 7). This observation demonstrates that the 
P4— S4 interaction is optimized in a tetrahedral 
adduct. Similarly, the hydrogen bond between the 
PI— NH and the Ser214 carbonyl appears to be 
stronger in transition state analogue complexes than 
in protein inhibitor complexes (Table 8). 45,236 These 
observations suggest that the interactions at the 
remote sites are optimized in the transition state. 



Table 7. Inhibition of Porcine Pancreatic Elastase by 
Peptidyl Aldehydes 3 



inhibitor 




Ac-Ala-Pro-Ala-CH 2 OH 


7000 


Ac-Pro-Ala-Pro-Ala-CH 2 OH 


600 


Ac-Ala-Pro-Ala-CHO 


62 


Ac-Pro-Ala-Pro-Ala-CHO 


0.8 


a Data from ref 47. 





Table 8. Optimization of Hydrogen Bonds in 
Transition State Complexes 3 



hydrogen bond 


boroVal 


Pval-Lac 
(Pval) 


ovomucoid 


oeriyo i\ ri ui 


9 Q 


9 Q 


Q 1 


Vjiyiyo in ri ui 




2 6 


9 


His57 N^2-P1 02 


2.7 


5.5 (2.9) 6 




Ser214 0-P1N 


3.0 


3.0 


3.6 


Gly216N-Ala P3 O 


3.0 


3.0 


2.9 


Gly216 0-Ala P3N 


2.9 


2.9 


3.0 


His57 Ne2-Serl95 Oy 


3.0 


7 A (3.0) 


2.6 


Leu41 0-P2 , N 




2.8 


2.9 


Leu41N-P2 , 0 




3.0 


3.1 



a X-ray crystal structures of alpha lytic protease complexes 
with Boc-Ala-Pro-boro-Val, 37 Boc-Ala-Ala-Pro-Pval- 
Lac-Ala, where Pval is the phosphonic acid analogue of Val 
and Lac is lactate 45 and Boc-Ala-Ala-Pro-Pval (in paren- 
theses) and ovomucoid third domain 236 are compared. Data 
from ref 45. b In the Boc-Ala-Ala-Pro-Pval-Lac-Ala, His57 
has rotated away from the inhibitor, presumably due to the 
absence of a negative charge in the neutral phosphonate. The 
Nel hydrogen bond is present in the charged phosphonate. 



C. Remote Interactions Induce a Conformational 
Change that Favors Catalysis 

One oft-invoked mechanism for utilizing remote 
binding interactions is induced fit, i.e., remote inter- 
actions induce a conformational change that aligns 
the catalytic machinery, activating the enzyme. 
However, no conformational change is observed when 
chymotrypsin or trypsin bind substrates or inhibitors, 
and the alignment of the catalytic triad does not 
change. Thus, a classical induced fit mechanism is 
not operative and serine proteases approach the ideal 
of the enzyme as a template for the transition state 
(examples of serine proteases that do utilize an 
induced fit mechanism will be discussed below). 
However, subtle conformational changes are observed 
when apoenzyme and inhibitor complexes are care- 
fully compared. For example, when trypsin binds a 
peptidyl boronic acid, small adjustments are observed 
in —50 of the 224 residues, 237 and similar adjust- 
ments are also observed in boronic acid structures 
of alpha lytic protease. 37 These structural changes are 
small and seem unlikely to involve a significant 
energetic barrier. Rather, this structural plasticity 
is probably required to accommodate product as- 
sociation/dissociation and the different transition 
states of the protease reaction. 

If good substrates induce conformational changes 
that increase the reactivity of the catalytic triad and 
oxyanion hole, then disruptions of the catalytic 
machinery should have a more deleterious effect on 
the hydrolysis of good substrates. Unfortunately, 
there are little data on this point because these 
mutations decrease activity to the point where only 
good substrates can be monitored. The substitution 
of Asp 102 with Asn appears to have a more deleteri- 
ous effect on the hydrolysis of peptide substrates than 
other substrates, but this difference is not observed 
when the pH-independent reaction is considered. 79 
Mutations of His57 do change specificity, but this is 
expected because His57 forms part of both the S2 and 
ST sites (Figures 7). Unfortunately, it will be difficult 
to correlate such subtle structural differences to 
catalysis. 
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D. The Remote Interactions Shield the Catalytic 
Triad from Solvent 

Many enzymes, even proteases, have motile flaps 
that close over the active sites, preventing solvent 
access to the active site. Such desolvation provides a 
simple mechanism to achieve large rate accelera- 
tions. 262 - 264 The remote portions of the peptide sub- 
strate could perform this task for serine proteases. 
In particular, the P2 and PT residues will shield 
His57 from bulk solvent. This effect can be illustrated 
by the exchange of the protons of His57 with solvent 
in trifluoroketone complexes. The rate of exchange 
decreases markedly with better P2 residues (from 280 
to 12s" 1 ; these rates are lOOx greater than the rate 
of inhibitor dissociation, indicating that proton ex- 
change depends on the diffusion of water into the 
complex). 150 Thus, the P2 residue blocks water access 
to the catalytic triad. Interactions at more remote 
sites could similarly restrict solvent access as well 
as lower the dielectric constant of the active site. 
Similarly, the "trans" phenomenon could be explained 
by the expulsion of water from the active site, or 
perhaps by neutralizing the negative charge of Asp 189, 
favoring catalysis by lowering the dielectric constant 
in the vicinity of the scissile bond. Note that the tight 
packing of enzyme and substrate can cause steric 
compression that will also facilitate proton trans- 

f er< 23,163,238 



E. Remote Interactions Couple Catalysis to 
Motion of the Protease Structure 



VIII. Specificity via Induced Fit and Allostery 

The above discussion focused on specificity that 
derived from a relatively rigid active site cleft as 
found in chymotrypsin, trypsin, and elastase. Serine 
protease specificity has also been ascribed to classical 
induced fit mechanisms and allosteric interactions. 
The following will provide examples of these phe- 
nomena with complement protease Factor D, the 
bacterial enzyme a-lytic protease and the coagulation 
protease thrombin. 

A. Induced Fit as a Mechanism for Narrow 
Specificity 

It is important to recognize that an effective 
induced fit mechanism requires more than the simply 
placing a conformational change in the reaction 
coordinate (Figure 15). 240,241 An enzyme that can 
rapidly equilibrate between inactive and active con- 
formations will be no more specific than an enzyme 
that does not undergo a conformational change— the 
equilibrium between inactive and active forms simply 
lowers the fraction of active enzyme. This must be 
the case if the chemical transformation is rate- 
limiting for the reaction; the substrate binding steps 
will be in equilibrium, and no advantage is gained 
from the conformational change. 

1. Rigid enzyme 

Si 



_ _ slow _ _ 

E^St E + P-, 



Protein dynamics is the wild card in protease 
catalysis and specificity, as is true of enzyme catalysis 
as a whole. The idea that reaction dynamics are 
coupled to protein motions has long intrigued bio- 
chemists. Such motions might be used to mechani- 
cally stress the peptide bond, disrupting resonance 
stabilization and activating the carbonyl for nucleo- 
philic attack. 239 Likewise, the attack of Serl95 on the 
scissile bond might be linked to movement of the 
surrounding residues. Less dramatically, dynamic 
movement could simply be important for substrate 
association and product dissociation. The location of 
the active site between the two beta barrels places 
the substrate at junction of domain motions in 
chymotrypsin-like proteases. 

The serine proteases provide a unique opportunity 
to address this question: if such protein motion is 
required for catalysis, then similar motions should 
be observed in chymotrypsin-like and subtilisin-like 
proteases, as well as the other serine protease 
families. On the other hand, it is difficult to see how 
protein dynamics could be maintained in four differ- 
ent structures. It may be possible to monitor protein 
motions during serine protease catalysis. Such ex- 
periments best utilize reversible reactions, which has 
thwarted their application to protease reactions. 
However, proteases also catalyze the exchange of 
water into the C-terminus of a product carboxylic 
acid. This reaction could be exploited to investigate 
protease motions during catalysis. 



specificity = 



2. Enzyme with a conformational change, K c * « 1 
E* + Si ^ E*-S, 

K,-| 

E + Si^zr E*Si -^U E + P t 

slow 1 

E* + S 2 E*«S 2 
E + S 2 ^ E«S 2 E + p 2 



specificity ■ 



(k cat /KJ S] x(l/K c *) (k cat /KJ s 



xO'V) (KJKJ s: 



3. Induced fit enzyme, two active conformations 

E* + S 1 J=r E*.Si E* + P-| 



E + S 2 ~ 



E-S 2 



E + P 2 



Figure 15. The effect of a conformational change on 
specificity. Simply adding a conformational change to an 
enzyme does not provide specificity relative to an enzyme 
that does not undergo a conformational change. As shown 
in Schemes 1 and 2, the conformational change simply 
decreases the amount of active enzyme form. However, 
with the added constraint that the conformational change 
is rate-limiting, and the good substrate accelerates the 
conformational change, specificity can be achieved because 
the binding steps are no longer at equilibrium. If both 
conformations are active as in Scheme 3, broad specificity 
can be attained. 
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This principle is illustrated in the trypsin/tryp- 
sinogen system, where mutations that alter the 
equilibrium between active and inactive conforma- 
tions do not change specificity. 242 The Ilel6-Aspl94 
salt bridge can be destabilized by the substitution of 
He 16 with smaller residues, favoring the inactive 
zymogen-like conformation. Such mutations decrease 
the hydrolysis of good and poor substrates equiva- 
lently, including both amides and esters. 243 In addi- 
tion, mutations can be introduced that activate 
trypsinogen by stabilizing the mature protease con- 
formation. 244 The equilibrium between inactive and 
active conformations can be determined by measur- 
ing the affinity of BPTI, which induces the mature 
protease conformation. 242 The decrease in values of 
kcJK m correlates with the decrease in affinity of 
BPTI, as expected for the mechanism of Figure 15. 

An induced fit mechanism can impart specificity 
when the conformational change is rate-limiting. In 
this case, a good substrate can accelerate the rate of 
the conformational change, and thus accelerate the 
reaction. The binding steps will not be at equilibrium 
because the substrate will rapidly react to products. 
Further discrimination can be achieved if chemistry 
is slow for the poor substrate. 241 

B. Factor D: An Induced Fit Protease 

Such an induced fit mechanism can account for the 
remarkable specificity of Factor D. Unlike most 
serine proteases, Factor D circulates in plasma in its 
mature form, and therefore must have extraordinary 
specificity. Factor D has only one natural substrate, 
the complement complex C3bB. 245 Remarkably, k cat / 
K m for hydrolysis of C3bB is 10 6 M" 1 s" 1 , comparable 
to the hydrolysis of a good peptide substrate by 
trypsin. However, Factor D has little esterase activity 
(k cat /K m = -300 M" 1 s" 1 compared to 10 7 M" 1 s" 1 for 
trypsin 197 ) and cannot hydrolyze peptides containing 
the C3bB cleavage site. 197 The structure of Factor D 
does not resemble an active serine protease, which 
explains these puzzling observations. 246 The SI site, 
S2— S4 sites and catalytic triad are deformed due to 
the presence of Ser94, Thr214, and Ser215 at posi- 
tions usually occupied by Trp/Tyr, Ser, and Trp/Phe, 
respectively. Substitution of these residues returns 
the catalytic triad to a normal conformation, but only 
increases the esterase activity 1 Ox. 247 The binding 
of C3bB must repair these defects and induce Factor 
D to assume the usual active conformation. 

C. Alpha Lytic Protease: Induced Fit as a 
Mechanism for Broad Specificity 

Induced fit also provides a mechanism for expand- 
ing specificity. 248 Imagine an enzyme has several 
conformations of comparable energies, each of which 
accommodates a different substrate or, more pre- 
cisely, the transition state for a different substrate. 
Each substrate will induce the appropriate conforma- 
tion for catalysis, and the enzyme can hydrolyze 
several substrates with comparable values of kc at /K m . 
Such an induced fit mechanism provides the struc- 
tural plasticity necessary for broad specificity. 



Table 9. Specificity of Wild Type and M192A Alpha 
Lytic Protease 3 







kcJKm M 




substrate 




wild type 


M192A 


Suc-Ala-Ala-Pro-Ala- 


-pNA 


2.1 x 10 4 


1 x 10 4 


Suc-Ala-Ala-Pro-Val- 


pNA 


790 


3 x 10 3 


Suc-Ala-Ala-Pro-Met- 


-pNA 


1800 


3.5 x 10 5 


Suc-Ala-Ala-Pro-Leu- 


-pNA 


4.1 


1.1 x 10 5 


Suc-Ala-Ala-Pro-Phe- 


-pNA 


0.38 


3.1 x 10 5 


a Data from ref 249. 









The broad specificity of the Met 192 Ala mutant of 
alpha lytic protease can be understood in terms of 
this structural plasticity (although induced fit is 
probably not an appropriate term since the catalytic 
machinery is always aligned). Alpha lytic protease 
prefers substrates with small aliphatic residues at 
PI. The SI site is small and relatively rigid, occluded 
by Metl92. Substitution of Metl92 with Ala would 
be expected to create a large hydrophobic SI site 
similar to chymotrypsin and the X-ray crystal struc- 
ture of the unliganded mutant protease shows that 
the SI site is enlarged. However, unlike chymo- 
trypsin, Met 192 Ala hydrolyzes PI— Ala and Phe 
substrates with comparable values of k cat /K m (Table 
9). 249 Inhibitor complexes show that the structure of 
the SI site of Met 192 Ala is not a simple hydrophobic 
pocket, but adapts to accommodate different PI 
residues. 251 Therefore, the SI site displays structural 
plasticity wherein each substrate induces the ap- 
propriate conformation for hydrolysis. The mutation 
appears to have a profound effect on the dynamic 
properties of the SI site. In wild-type alpha lytic 
protease, the walls of the SI site are coupled and 
move together as a unit so that the dimensions of 
the SI site are maintained. 252 However, the Metl92 
mutation disrupts this coupling, allowing the two 
walls of the S 1 site to move independently and the 
dimensions of the SI site to vary. 250 - 253 - 254 Thus, the 
mutant enzyme can sample many different confor- 
mations of the SI site, and hydrolyze many different 
substrates. 

Thrombin provides a second example of how speci- 
ficity can be expanded by the ability to access 
multiple conformations. Thrombin's role in the for- 
mation of a blood clot is well-known. 265,266 Less 
appreciated is thrombin's anticoagulation function. 
The substrates of the coagulation pathway (fibrino- 
gen, factor V, factor VIII, and factor XIII) contain 
neutral or positively charged P3 and P3' residues, 
while the substrates of the anticoagulation pathway 
(e.g., protein C) contain Asp at these positions. 
Thrombin requires two conformations to process such 
different subtrates. 255 - 256 The procoagulant conforma- 
tion, or "fast" form, hydrolyzes fibrinogen ~10 3 times 
faster than protein C. In contrast, the "slow" form 
has a 50-fold preference for protein C. 255 Importantly, 
Na + is an allosteric effector of this conformational 
switch, binding specifically to the fast form at a single 
site. 257 The fast and slow forms are approximately 
equal at physiological Na + concentrations. Na + is an 
allosteric effector of several other serine proteases, 
including Factor Xa, activated protein C, and Factor 
IXa; Na + regulation correlates with the presence of 
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Tyr or Phe at position 225, while Na + independence 
correlates with Pro225. 258 The Na + binding site lies 
between the 180 and 220 loops. 259 Substitution of 
Trp215 and Glu217 abrogates Na + binding, stabiliz- 
ing the slow form while Glu39, Trp60d, Glul92, 
Asp221, and Asp222 form an "allosteric core" that 
connects the Na + binding site to the active site cleft. 

IX. How Serine Proteases Work: Summary and 
Prospects 

Serine proteases are probably the most thoroughly 
investigated enzyme system. While the function of 
the catalytic triad and oxyanion hole can be rational- 
ized in terms of electrostatic stabilization of charges 
developing in the transition state, how remote bind- 
ing interactions facilitate catalysis remains a mys- 
tery. Catalysis and specificity are not simply con- 
trolled by a few residues, but are rather a property 
of the entire protein framework, controlled via the 
distribution of charge across a network of hydrogen 
bonds and perhaps also by the coupling of domain 
motion to the chemical transformation. Serine pro- 
teases present a unique opportunity to test these 
ideas. 

X. Useful Web Sites 

Rose, T., and Di Cera, E. Serine Protease Home 
Page. http : //www. biochem . wustl . edu/ ~ protease/ 
index.html. 

Rawlings, N. D., O'Brien, E. A., and Barrett, A. J. 
2002 MEROPS: The Protease Database. Nucleic 
Acids Res. 30, 343. http://merops.iapc.bbsrc.ac.uk/. 
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